) 
| 
| 
| 
| 
| 
| 


| wun 
ae 

i 
wt 


vt 


C 


jannitt 


‘ll! 


FOUNDED BY THE BIoMETRICS SECTION OF THE AMERICAN STATISTICAL ASSOCIATION 


TABLE OF CoNTENTS 


A Model for the Deterioration in Strength of Materials Due to 
meatal (ees oes. tJ. SS Maritz 


Optimum Group Size in Progeny Testing and Family Selection 
Alan Robertson 


Partition of Experimental Vectors Connected with Multinomial 
emioieetie ee ee ws ls Li CA. Corsten 


The Distribution of Red Blood Cells in the Hemacytometer 
Malcolm E. Turner and G. S. Eadie 


The Comparison of the Sensitivities of Similar Experiments: 
Applications . .. R.A. Bradley and D. E. W. Schumann 


Error Rates and Sample Sizes for Range Tests in Multiple Com- 
PAGeiGe meee .  . = » + - - Lt. Leon Larter 


Queries and Notes 


Some Simple Methods of Computing Parameters in the 


Analysis of Variance. ...... . . William G. Madow 
A Routine for Computing the Degrees of Freedom in Analy- 
a VATIANCE so as ss ee «ls 6» Ortil Matérn 
Corrections . 
IppLecIAWONg eee te ee . » EA, Cornish 
Abstracts 


The Biometric Society . 
News and Announcements 


Contents of and Index to Volume 15 


43 


w 


442 


451 


I ——E—E———— i ——ssss 
Number 4 December 1957 Volume 13 
a 


THE BIOMETRIC SOCIETY 
General Officers 


President, E. A. Cornish; Secretary, M. J. R. Healy; Treasurer, A. W. Kimball; 
Council, F. J. Anscombe, Claudio Barigozzi, L. L. Cavalli-Sforza, W. G. Cochran, 
G. M. Cox, Georges Darmois, B. B. Day, D. J. Finney, J. H. Gaddum, M. P. Gep- 
pert, Americo Groszmann, P. C. Mahalanobis, Donald Mainland, Leonard Martin, 
Motosaburo Masuyama, P. A. P. Moran, Jerzy Neyman, C. R. Rao, E. J. Williams, 
Frank Yates, W. J. Youden. 


Regional Officers 


Eastern North American Region: Regional President, B. Harshbarger; Secre- 
tary-Treasurer, A. M. Dutton. British Region: Regional President, D. J. Finney; 
Secretary, C. C. Spicer; Treasurer, A. R. G. Owen. Western North American Region: 
Regional President, D. G. Chapman;, Secretary-Treasurer, Elizabeth Vaughan. 
Australasian Region: Regional President, E. J. Williams; Secretary, G. S. Watson; 
Treasurer, G. A. McIntyre. French Region: Regional President, Eugene Morice; 
Secretary-Treasurer, Daniel Schwartz. Belgian Region: Regional President, R. 
Laurent; Secretary, Leopold Martin; Treasurer, A. H. L. Rotti. Italian Region: 
Regional President, Gustavo Barbensi; Secretary, L. L. Cavalli-Sforza; Treasurer, 
R. E. Scossiroli. German Region: Regional President, O. Heinisch; Secretary-Treas- 
urer, Wilhelm Ludwig. Brazilian Region: Regional President, C. G. Fraga, Jr.; 
Secretary, P. Mello Freire; Treasurer, A. Groszmann. 


National Secretaries 


Denmark, N. F. Gjeddebaek; India, K. Kishen; Japan, M. Hatamura; The 
Netherlands, E. van der Laan; Sweden, H, A. O. Wold; Switzerland, H. L. LeRoy. 


Editorial Board 
Biometrics 


Editor: Ralph A. Bradley; Editorial Associates and Committee Members: C. I. 
Bliss, Irwin Bross, E. A. Cornish, 8. Lee Crump, W. J. Dixon, Mary Elveback, 
J. W. Hopkins, O. Kempthorne, Leopold Martin, Horace W. Norton, G. W. Snedecor 
and Georges Teissier. Managing Editor: Ralph A. Bradley. 


The Biometric Society is an international society devoted to the mathematical 
and statistical aspects of biology. Biologists, mathematicians, statisticians and others 
interested in its objectives are invited to become members. Through its regional 
organizations the Society sponsors regional and local meetings. National secretaries 
serve the interests of members in Denmark, India, Japan, the Netherlands, Sweden 
and Switzerland, and there are many members at large. 


Rates (in U.S.A. currency) for full membersh!p in the Society for 1957 including dues and a 
subscription to this journal are: for residents of Canada and the United States $7.00 (Student Mem- 
bership $4.00); for others $4.50 (Student Membership $2.75). Members of the American Statistical 
Association who are currently subscribing to Biometrics through that organization may become 
members of the Biometric Society on payment of $3.00 annual dues if resident in the United States 
or Canada, and of $1.75 annual dues if resident elsewhere. Information concerning the Society can 
be obtained from its Secretary, M. J. R. Healy, Statistics Department, Rothamsted Experimental 
Station, Harpenden, Herts, England. 

Annual subscription for non-members of the Biometric Society is $7.00, payable to the Managing 
Editor, Biometrics, Dept. of Statistics, Virginia Polytechnic Institute, Blacksburg, Virgina. 


Second-class mailing privileges authorized at Knoxville, Tenn. Additional entry 
at Richmond, Va. Business Office: 509 West Hill Road, Knoxville 19, Tennessee. 
Biometrics is published quarterly—in March, June, September and December. 


il 


A MODEL FOR THE DETERIORATION IN STRENGTH OF 
MATERIALS DUE TO FUNGAL ATTACK 


J.S. Manrirz 


Transvaal and Orange Free State Chamber of Mines Research Laboratories 
Johannesburg, South Africa 


? 


INTRODUCTION 


Many materials used by industries are subject to fungal attack 
and the conditions to which they are subjected are frequently favourable 
to the growth of fungi. The result of fungal attack is usually that the 
material gradually loses its strength and eventually disintegrates; but 
it may actually become useless long before such an advanced stage 
is reached. On the other hand, due to slow decrease in strength the 
material may serve its purpose for a long time even though it is being 
attacked by fungi. Laboratory studies of the rate of decrease in strength 
under different conditions are, therefore, of considerable importance. 

Development of a mathematical model for the loss of strength due 
to fungal attack seems desirable for at least two reasons. They are: 


(i) the results of the laboratory experiments are usually observations 
on a number of random variables and a mathematical model 
provides a description and summary of a set of data which can 
be very useful, particularly when different sets are to be com- 
pared with each other. 

(ii) as many of these investigations are rather lengthy, a sound 

” model can be of great use in the efficient planning of experiments. 


The basic assumptions on which the model in this paper rests are: 
if a specimen of material is exposed (to fungal attack), a certain period 
of time will elapse before the attack actually starts and, once it starts, 
the strength decreases gradually to zero. Both the time of ‘survival’ 
and the rate at which the strength decreases may be random variables. 

A specific example, in which connection this approach has been 
successful, is encountered on the South African gold and coal mines 
where various fabrics are used underground for dust filtering and other 
purposes. An example of a laboratory experiment with Cotton Duck 
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in which tensile strengths of pieces of standard shape and size were 
measured is given in a later section. 


MATHEMATICAL MODEL: GENERAL FORM 


Suppose that a number of sample pieces from the same batch of a 
material are exposed to fungal attack under conditions as nearly identical 
as possible. The attack on individual samples will actually start after 
time intervals t, , £2 , --: , where ¢ = 0 corresponds to the start of the 
experiment. Each ¢t; may be regarded as a realization of a random 
variable t with probability density g(t)dt, so that the expected propor- 
tion of samples remaining with no attack after time ¢ is 


1- f od at=1- GW. 


The initial (i.e. unexposed) strength will vary from one sample to 
another, but, for the moment, we consider only samples with the same 
initial strength x, . Take one on which attack starts at time ¢, , 1.e. 
during the interval ¢, + 46¢t,. At t (tf, > #,), due to the attack the 
strength has decreased to x f(t. — t,). This function f(t, — t) must 
satisfy two conditions: 


(a) fl2—4)=1 for = 
(b) f(t —t)—-0 as ¢, increases. 


The form of f may itself be a random variable, but we make the simplify- 

ing assumption that it remains fixed and that its value for fixed (#, — t) 

is a random variable only by virtue of its dependence on a constant 

a which is a random variable. This means that the samples on which 

attack starts at ¢, will at ¢, have strengths wof(t. — t, , Qi), Tof (te — ti , ae) 
- with expected mean value 


E.|*of(tz — 4 , a)] = toH [f(t — t , a)]. 


We are now able to calculate the expected mean value at time ¢ 
of all samples with initial strength x, on the assumption that the random 
variables x and ¢ are independent: the total proportion of attacked 
samples after time t is G(t). The proportion of these attacked during 
the interval + + 467 is g(r)6r/G(t), and the mean strength of these at 


time t(> 7) is xH, [f(¢ — 7, a)]. Therefore the mean of all attacked 
samples at time ¢ is 


iro] [Halil — 2), aoa) dr, 
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and it follows that the overall mean at ¢ is 


Xo i E.Lf(t — +, a@)]g(x) dr + all — G()]. (1) 


Assuming that the random variables a and x are independent, we find 
that the expected mean strength at time ¢ is 


By.) = He) [ Elie — 7, dg) dr + BO - GO], @ 
where H(x) is the expected mean strength of unexposed samples. 
MATHEMATICAL MODEL: SPECIAL FORMS 


The special forms which the functions g(é) and E,[f(t2 — t, , a)] 
may assume are, of course, numerous and we will give only two special 
forms of H(y,) resulting from what appear to be reasonable assumptions 
about the two functions. 

The function g(t). The distribution of t must be continuous starting 
at t = O and covering the range 0 — ~. A simple and plausible choice 
for g(t) is 

g(t) = pe*’. (3) 


This distribution has only one parameter, 8, with the ‘physical’ meaning 
that the mean time of survival before attack starts is 1/8. We were 
influenced in this choice of g(t) by the knowledge that this type of 
distribution has been found to fit a large variety of ‘life’ distributions, 
e.g., life of electron tubes [Epstein 1954]. 

The function E,[f(t. — t, , a)]. In considering this function we 
must really take account of two functions, i.e. f and the probability 
density p(a) of the random variable a. We have examined various 
forms of f and found 

Via who), eee (4) 


one of the most easily handled and also practically successful. 

The probability distribution of a. Unless fairly simple forms for 
p(a) are used in conjunction with (4), the resulting expression for 
E.[f(t. — t, , a)] can be very awkward indeed. Following are two 
examples of special forms of H(y,) obtained by using (3), (4), and two 
very simple forms of p(a): 


Example (1) 
We note that, substituting (3) and (4) in (2) gives the result 


es) 


By) = Be) [ 6 =F pla) da + Boe". 
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If we put 
p(a) = a(a — a), (6) 


the Dirac delta function, i.e. a assumes the value a with probability 
1, (5) becomes 


Ey.) = pe 


This is a simple expression which may readily be ‘fitted’ to observed 
data, and we have found it to be applicable to many of our laboratory 
experimental results. An example will be given in a later section. 


[ae *’ — Be~*']. (7) 


Example (2i)* 


If we assume, instead of (6), that the distribution of a@ is located 
at a and has a small variance o’, expression (2) may be written approxi- 
mately as 


muy * Be [| se — nr) +5 Le — 2,0) lacs ar 


oat (8) 
and substituting (3) and (4) we have 
Bly) # APE foe — Ber") 
(9) 


bee ty set soy — art + 


This is a more flexible expression than (7), as can be seen more 
clearly if we consider the special case 6 = a which gives for (7) and 
(9) respectively, 


Ely.) = E(aje*"(1 + at) 


and 
E(y,) = is(aer*(1 + at + ae. e), 


As the form (7) appears, despite its simplicity, to be of real practical 
use, the following section gives some detail about estimating its param 
eters. 
ee EEE ee 


*I would like to acknowledge my indebtednegs to one of the referees for drawing my attention 
to this example. 
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METHODS OF ESTIMATING THE PARAMETERS IN EQUATION (7) 


The parameters are H(x) = £, , wand @. As our main concern so 
far has been to see whether the model fits our observations at all, we 
have not gone into the question of estimation very carefully and can 
suggest only some crude methods of estimating the parameters. 

Considering the way in which the model is built up, it is clear that 
in some circumstances £ and 6 may be estimated without using the 
observed regression of y; on t. 

Estimate of & . By definition & is the expected mean value of 
unexposed unattacked samples so that if we have N such samples with 
strengths x, --- xy an estimate of & is simply 


£ = >> 2,/N. (10) 


As it is usually true that N can be made large without much trouble 
or expense this procedure may be recommended. 

Estimate of 8. Sometimes it is possible (e.g. by microscopic exami- 
nation) to discriminate between attacked and unattacked samples 
before they are, as is often the case, tested to destruction. When this 


can be done, a series p; , p2 , -** of observed proportions of attacked 
samples at times ¢, , t, --- isfound. According to (3) 
E(p) =1—e*". (11) 


Fitting the expression (11) through the observed p, by, for example, 
the maximum likelihood method gives an estimate of . 

Estimate of a when B and £, are known. A weighted least squares 
procedure appears to be advisable. If the observed values of y, at ¢, 


(> 0) are yi: +++ Yin, and the weights are w, --+ w, , We minimize 
2 
s= D wn] ve See pe" | “ (12) 
with respect to a, where y;, = SS; yii/n; , and the result is 
elie vee 8 + Bie |= 0 (18) 


where &,Y,; = E(y:,). 

If ¢, and @ are unknown but estimated as above, those estimates 
may be substituted in (13), which procedure will lead to a consistent 
estimate of a. 

Joint estimation of a and B by weighted least syuares. When a separate 
estimate of @ cannot be obtained, a and 6 may be estimated jointly 
by minimizing S, [see equation (12)] with regard to both a and p. 
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One difficulty is, however, that a and @ are interchangeable in the 
expression (7). It may be resolved by making use of the fact that 


E(’) 
(2a — 8B) 


where & = E(x”), which may be derived in the same way as the value 
of E(y,). For, it may be shown that a consistent estimate of a is obtained 
if, of the two possible estimates of a, that one is chosen which gives the 
Hell difference, in some sense between (22, B) a and the observed 
values >\y?,;/n;. Even a crude rule such as: choose a = @ or 8 according 
to whether wu 2 v or u > v where 


US > [2 y;;/M ars £,(22, B, t;) | 
p= ay, Pe: Yii/Ni =z £,(28, &, t.)] 


s 


Ey) = [2ae"* — Be~?*"] = £,(2a, B, t), (14) 


(15) 


can be satisfactory. 

Standard errors of weighted least squares estimates of a and B. By 
argument similar to that used, for example, by Kendall [Kendall, 
1948, p. 208] to calculate the standard errors of functions of moments, 
one can show that the variances and covariances of least squares 
estimates @ and @ of a and B are approximately given by 


cov (8, 8) = (A™)B(A™')’, (16) 


where 


Ealy  Eu(Qhy(t) 
S w(t ) 
D, 


a 
Ve) eal) 
ou (22) ute = (28:)( htt | 


E(eeear = 


& —Bti —at; 
Y= (@— p) (ae ak Be )= £,8(@, B, ti), 


and oj = var (y;.). 
If w, is chosen equal to 1/07 , the result is simply 


cov (@, 8) = Aq. (17) 
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To take account of the variance of the estimate £, of & , the elements 
of B must be increased by amounts 


cs wd: ve)! var (£,), ete. 


Finally, the variance of E(y,) is approximately (for &, known) given by 


var (Bty.)] = (2 , 28) 4-( We , abe) (18) 


PRACTICAL EXAMPLE 


Standard size pieces of Cotton Duck treated with copper naphthenate 
were exposed to attack by a test fungus (T.R.L. 255 Chaetomium 
Globosum). Altogether 30 pieces were exposed and by random choice 
they were tested for tensile strength in groups of six each after 13, 26, 
39, 52, and 65 days of exposure. The tensile strengths of ten unexposed 
pieces were measured. The individual tensile strength values (in 
arbitrary units) are given in Table 1. 


TABLE 1 
TENSILE STRENGTHS OF CoTTon Duck 


Period of exposure in arbitrary time units 


Unexposed 1 2 3 4 5 
100.5 76.0 24.2 1380 1.5 0.0 
88.2 67.3 33.3 3.5 0.0 0.0 
107.5 66.5 28.3 13.0 4.8 0.0 
102.5 58.6 37.3 8.3 2.5 2.5 
96.0 67.3 34.5 5.0 0.5 0.0 
107.8 74.8 22.0 5.0 5.8 2.0 
94.0 
103.5 
98.9 
99.0 


All the pieces examined after 13 days of exposure were attacked 
so that these data were not considered suitable for estimating 6 as 
suggested by equation (11). As estimate of & , the mean of the un- 
exposed pieces was used, i.e. £, = 99.79. Estimates of a and B were 
then found by minimizing S, with respect to a and 6, where £, was 
substituted for £, and the weights w,; were taken as w; = n ;/s; where 
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s? is the observed variance of the strengths at time ?;. The result was 
that the two values of @ and @ were 1.30 and 1.15. The observations 
and the graph £,@(1.30, 1.15) are plotted in the accompanying figure 
and it is clearly a very good fit. A plot of the squares of the observations 
and the curves &,@(2.60, 1.15) and £,6(1.30, 2.30), showed by inspection 
that we should take @ = 1.30, 6 = 1.15. 


f 
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The variances and covariances of & and § calculated as suggested 
above turned out to be 


1.323 —1.008 
— 1.003 .763 


cov (@, 6) & 
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These values are surprisingly large, due, we believe, to the fact that in 
this example @ is very nearly equal to 8. However, due to the high 
negative correlation between @ and 8, using equation (18) shows that 
E(y:;) may be estimated with reasonable precision. For example at 
t = 1,s.d. B(y,) = 2.12. Of course, bringing into the calculation 
the variance of £, increases this standard deviation and the result is 
s.d. E(y,) = 3.18. Att = 5 the two standard deviations are respectively 
.28 and .32. No details are given of the calculation incorporating var 
(é,) because normally it is easy to make N so large that var (£,) may 
be made negligible. 
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OPTIMUM GROUP SIZE IN PROGENY TESTING AND 
FAMILY SELECTION 


ALAN ROBERTSON 
Institute of Animal Genetics, Edinburgh, Scotland 


The research worker concerned with animal breeding problems is 
frequently asked the question “How many progeny do I need to test 
a sire adequately?” This is an awkward, if not impossible, question 
to answer because it depends on the meaning of the word “‘adequately.” 
The more progeny the more accurate the assessment—the level of 
accuracy that is considered adequate is a matter of subjective Judgment. 
But, by making the content of the question a little more practical and 
precise, it can be put in forms to which a useful solution is possible. 

In any progeny-testing programme (this implying a _ breeding 
programme, relying on progeny-testing for improvement, rather than a 
method of evaluating sires), two factors external to the actual testing 
procedure will define the programme. They are the facilities available 
(limiting the total number of progeny which can be tested in any genera- 
tion) and the number of sires that will be selected each generation. 
The latter will be limited by the rate of inbreeding which can be tolerated 
in the programme. ‘The further variable factor in the programme is 
the number of sires which will be tested each generation. This, in 
relation to the available facilities, determines the group size. The 
problem then is to determine the group size which will lead to the 
maximum expected genetic superiority of the selected sires. This is 
a balance of choice against accuracy. The more sires are tested the 
greater will be the choice, but the less accurate the information on 
which the choice is made, because the number of progeny per sire is 
less. This problem can be solved within the framework of statistical 
genetics. 


Within this framework, the expected genetic superiority of chosen 
sires is then given by 


AG = Wiel, 


where 7 is the selection intensity (in standard units) corresponding 
to the proportion of sires selected, r;g is the correlation between the 
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progeny average J and the breeding value of a sire, and g, is the genetic 
standard deviation in the population. If we can assume a normal 
distribution of the breeding value of sires, then 7 can be obtained from 
the proportion selected from standard tables. If we know the herit- 
ability h° of the character on which selection is based, then if there are 
no non-genetic causes of differences between sire groups, we have 


7, — Cia ae 
eS NTF EG — DH 


where n is the number of progeny in the group. When we have testing 
facilities for N animals and a knowledge of the number of chosen 
sires, S, required, we can easily calculate the value of m which maximises 
AG. The results are illustrated in Table 1 for N = 100, S = 2, h? = 0.25. 


TABLE 1 


THe ExpecTtEeD GENETIC SUPERIORITY OF SELECTED Srres In A PROGENY 
TESTING SCHEME 


(N = total progeny measured, S = sires selected, n = offspring/sire, 
AG = genetic superiority of selected sires in genetic standard deviations). 
Group size (n) 4 5 10 20 
Accuracy (ry¢) 0.46 0.50 0.63 0.76 
N =100,S =2 
Sires tested 25 20 10 5 
7 1.74 1.64 127, 0.83 
AG 0.80 0.82 0.80 0.62 
N = 200, S = 4 
Sires tested 50 40 20 10 
7 1.80 1.69 1.33 0.90 
AG 0.83 0.85 0.84 0.68 
N = o~, N/S = 50 
Sires tested 12.58 10S 5S 2.58 
i 1.86 I fA’ 1.40 0.97 
AG 0.86 0.88 0.88 0.74 


AG passes through a maximum when 7 is in the region of 5. Suppose 
we now turn to other values of N and S, but keep the ratio of N/S 
constant. The values of 7 and AG are slightly altered in each case, 
because for a given proportion of sires selected, 7 increases a little as 
the number selected from increases. The position of the maximum 
is little changed. It seems then that with a considerable gain in gener- 
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ality, we can treat the value of N as very large and use as our new 
variable N/S, the total number of progeny recorded for each group 
that is eventually selected. This we shall call the testing ratio and 
give it the symbol K. It is convenient here to introduce a further 
variable, p, the proportion of sires selected. As the number of sires 
tested each generation is N/n, it follows that 


orn = Kp. 
In terms of K and p, the problem turns out to have a general solution. 


To calculate 7, we assume that we are selecting from a large sample, 
so that 


eee 
t= s= 
Pp 
where 2 is the ordinate of the normal curve at the point where the area 
cut off = p. 
Similarly, 
=~ ee A 
a n ( h —4- 2) 
rs a where a = >a 
= A Peer eees 
pa) 
Then 


AG a ee 
pNp+a/K” 


Thus the expected superiority of the chosen sires is a function of 
pand K/a. For any value of K/a, we are concerned with finding the 
value of p which maximises AG. It follows that the optimum value 
of p will be a function only of K/a, as will the maximum value of AG. 
We have then introduced a much greater generality into the problem. 

We have now to find the maximum value of AG or of (z/p) 
-Vp/(p + a/K). Itis algebraically much simpler to find the minimum 
of the square of the reciprocal of this, [p(p + a/K)}/z’. In this, we 
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must remember that dz/dp = x, the abscissa of the normal curve at 
the point defined by p. 


d rp +l) | _ LT 2 a e de 
fal 2" = 7 |2\PtR pe ee | a 


ee 
[20 +) —2( + )] 


0, at the minimum. 


This simplifies to 


K _ 1 2pr-z 
a Ap 2. pa 


We have thus equated K/a to a function of p, easily calculated from 
tables of the normal distribution. We can thus determine the optimum 
value of p in terms of K/a, and therefore also the maximum value of G. 

These are shown in Figures 1 and 2, in which K/a is plotted on a 
log scale. Figure 1 shows the optimum value of p and Figure 2 the 
maximum value of AG. Several interesting points emerge from a study 
of these. The first is that p increases as K/a declines but approaches 
asymptotically the value of 0.27 (where 2px = z). Thus, for the 
optimum running of any such scheme, the intensity of selection between 
tested sires must be at least one infour. At values of K/a greater than 
3, it is found empirically that the curve approximates closely to 
p = 0.28 Va/K. Substituting n = Kp, this leads to the expression 
n = 0.56 V K/h’ as the optimum value for group size. 


° 204 


_ Kft 
4-h 


FIGURE 1 
THe Optimum Proportion or Sires Seiecrep (p) as A Funcrion or Kh?/(4 — h?) 


As might be expected, AG,,.x increases as K/a increases. Above a 
value of K/a of about 5, it becomes a straight line in the plot against log 
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K/a. This fact, which has thus far defied algebraic check, means that 
a given proportionate size in K/a will produce the same arithmetic 
increase in the superiority of chosen sires. 


FIGURE 2 
Tue Exprcrep Genetic SUPERIORITY OF SELECTED SIRES (IN TERMS OF og, IN THE 
PopunLATION) as A Function or Kh?/(4 — h?) 


Tolerable Limits of Growp Size 


We have so far determined the group size which will give maximum 
superiority of chosen sires. It is useful to inquire further into the 
limits of variation in group size which can be tolerated before the 
genetic superiority falls off markedly. This can only be done empirically 
by drawing out curves of AG against n for different values of K and 
of h?. Figures 3 shows the curves for K = 100 for different values of 


—- oe 
9 25 


GROUP SIZE 


FIGURE 3 
Tue Variation or GENETIC SuppRiIorITy or SELECTED Sires wirn Group Sizu 
(K = 100, h? Vartasie) 
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h?. For simplicity, they have been drawn as continuous functions. 
They all must intersect the AG axis atn = Oandn = K (the latter 
implying p = 1, no selection). This imposes limitations upon the shape 
of the curves and on the optimum value of n. Thus in this case over the 
range of heritabilities from 0.01 to 0.25, AG will be within 10 percent 
of its maximum over the range of n from 10 to 26. For the comparison 
of the algebraic results with the actual curves, the three curves corre- 
spond to the points A, B, and C in Figures 1 and 2. 


20 
K = 1000 


° 25 * 50 
GROUP Size 


FIGURE 4 
THe VARIATION OF GENETIC SUPERIORITY OF SELECTED Srres wiTn Group Size 
(h? = 0.25, K VARIABLE) 


Figures 4 shows the set of curves for h* = 0.25 and different values 
of K. AG will be zero for all when n = 0, but the separate curves will 
again be zero at different points (when n = K, in fact). 

There is thus not the same restraint in the position of the maximum. 
For h? = 0.25, there is a short range, in the neighborhood of n = 12, 
for which AG is within 10 percent of its maximum value over these 
ranges of K. But for lower values of h’, these tolerable ranges of n no 
longer exist. 

TABLE 2 


Tue Rance or Group Size Givine Genetic Improvement Wiruin 10 
PERCENT oF THE Maximum For A Herirasitiry Rance 0.01-0.25. 


Testing ratio K Group size 
50 6-11 
100 10-26 
200 20-40 


1000 60-90 
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Rather surprisingly, it seems that tolerable values of n are depending 
much more on the testing ratio K than on h’. In fact, useful ranges of 
n can be given in terms of K which will give genetic superiority of 
chosen sires within 10 percent of the maximum value over the h’ range 
0.01 — 0.25. Table 2 gives these ranges for certain values of K. The 
preferred values within the range will increase as h’ decreases. 


The Effect of Non-Genetic Differences between Groups 


The case which we have so far discussed assumed that the variance 
component between progeny groups was entirely genetic in origin. In 
many practical instances, however, this condition is not satisfied and 
there are environmental effects common to all members of a group. 
This reduces the effectiveness of selection as it introduces a non-genetic 
element into the between-group variance which cannot be reduced by an 
increase in group size. Formally we may describe this as a proportion 
(c’) of the total variance, which is common to all members of a group. 
We have then the expression for Trr¢ . 


2 
inh 


Zoe N Tee iG =i de) 


1n(h? + 4c”) 
~ VA? + 42 “ Ac’ ae + 1(n — 1)(h? + 4c’) 


We may then generalise the presentation given above to include 
this case by writing 


pe Nae eg 

— he + 4e 
in the determination of optimum structure and by mu the 
expected improvement by the factor Wh?/h” + 4c’. As a’ will be less 
than a, it follows that for a given K the value of p at the optimum will 
be less than in the simpler case so that the optimum group size will be 
decreased and the number of sires tested increased. The generalizations 
we derived earlier about the minimum intensity of selection of 1 in 4 
for such a programme will therefore still hold. The tolerable limits 
of group size will still hold, except that the critical range they refer to 
will be of h? + 4c? instead of h’. 

Examination of the expression for r;g shows that it falls con- 
ceptually into two parts, if we introduce the idea of the breeding value 
of a sire under a repeatable but not measurable environmental bias. 
The first term, Wh?/(h? + 4c’) , is then the correlation between the true 
breeding value and the actual value under that bias, and the second, 
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1+ 2@ — 1)’ + 4c’)’ 


is the correlation between the biased value and its estimate, which has 
been subject to further error by sampling within the group. 


Discussion 


The treatment so far presented has been in terms of a progeny- 
testing programme. Exactly the same formulae will of course hold in 
half-sib family selection. In full-sib selection, the formula will have 
to be slightly modified. We should then write a = (2 — h’)/h’, rather 
than (4 — h’)/h’ in the half-sib case. We have presumed that the family 
size is variable at will. This of course is not so and the logical limitations 
in this are in some cases critical. Full-sib selection in cattle is obviously 
absurd and in poultry, full-sib family size is limited by the need to 
have only a limited hatching period. These factors, rather than the 
considerations presented here, will then be of overriding importance. 

It may be of interest to illustrate this approach with two practical 
examples. The writer is associated with a progeny-testing scheme in 
dairy cattle at an artificial insemination centre. The essential fact 
of the scheme is the breeding of new bulls for testing from the best 
available proven sires. In each generation cycle, two sires (S = 2) will 
be selected on the basis of a total tested sample of 300 heifers (V = 300) 
so that K = 150. If we assume h” = 0.25 (a = 15) we find a K/a value 
of 10. Referring to Figures 1 and 2, we find that at the optimum, 
p = 0.10 and G = 1.2c,. Thus we should test 20 bulls on 15 daughters 
each—the actual intention is to test 12 on 25 daughters, because of the 
cost of buying and keeping too many sires unemployed. As g, is in the 
neighborhood of 80 gallons, the selected sires should show a genetic 
superiority of some 100 gallons, which in their daughters would be seen 
as a production superiority over their herd mates averaging 50 gallons. 

Secondly, let us consider a flock of 500 hens to be improved by 
half-sib family selection, in which we take 5 selected sires as the lower 
limit possible. This gives K = 100 and, if we assume a heritability 
of egg production of 0.05, we have a K/a value of 1.25. At the optimum, 
this gives p = 0.19. We should therefore test 26 sires a year on 19 
daughters each. 


Summary 
A general procedure is presented for the problem of optimum group 


size in progeny-testing and family selection programmes. This involves 
the testing ratio, K, the ratio of the number of individuals which can 


450 BIOMETRICS, DECEMBER 1957 


be measured each generation to the number of sires (or family groups) 
that are to be selected. The optimum structure of the population and 
the probably genetic superiority of chosen groups are then dependent 
only on K, the heritability, h°, and the genetic relationship within 
groups. The treatment has also been extended to cover problems 
arising from non-genetic differences between groups. 


PARTITION OF EXPERIMENTAL VECTORS CONNECTED 
WITH MULTINOMIAL DISTRIBUTIONS 
L. C. A. Corsten 


Institute for Research on Varieties of Field Crops (I.V.R.O.) 
Wageningen, The Netherlands 


1. Introduction 


The investigation of variates in contingency tables (which need 
not be orthogonal) often gives rise to a partition of the experimental 
result in components each illuminating one aspect of the problem in 
question. This is here expressed in terms of vectors. Compare Kuiper 
[9]. 

Our presentation is perhaps more transparent than previous papers 
dealing with similar subjects: Fisher [3], Fog [4], Irwin [6], Lancaster 
[10], [12], [13]. Moreover our method is general; hence it can be applied 
also to more intricate cases. Our results can be applied in genetics, 
which in fact motivated our research. This is why we give an intro- 
ductory section (Sec. 2) on genetics. 


2. Genetics 

We consider externally perceptible properties of individuals (plants, 
animals), like the colour (green or yellow) of seeds, or of eyes (brown or 
blue) of men. Each property to be considered is determined by genes 
of only one locus, dominant A and recessive a, such that the only 
possible distinction with respect to that property is dominant (AA or 
Aa) or recessive (aa). 

We assume that the choice of a paternal gamete by a zygote is 
stochastically independent of the choice of a maternal gamete. More- 
over we assume equal viability for every combination of gametes. 

Then it follows that crossing individuals Aa with each other will 
produce zygotes having the dominant and the recessive form of the 
property with probabilities 7 and ¢ respectively. 

We consider a second property and locus with genes B and b. If 
this locus is on a chromosome different from that carrying A and a, 
the dominance or recessivity will be independent for the two considered 


properties. 
451 
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Crossing such individuals Aa/Bb with each other yields zygotes 
with probabilities as mentioned in the following table: 


A a 


ees 1 
pas (1) 
16 16 
Shu 
e 16 16 


where the capitals and lower case denote the dominant and recessive 
phenotypes respectively. The column and row totals represent the 
chances of the dominant and the recessive forms of the first and the 
second property respectively. 

If both loci occur on the same chromosome and crossing-over does 
not take place, the gametes of an individual Aa/Bb are only AB and 
ab with equal probabilities. Crossing such individuals with each other 
produces zygotes for which the following table of probabilities appears: 


ee bee et 
HAN be AU (2) 
Dal Ole 


In a similar way crossing individuals Aa/bB with gametes Ab and aB 
yields: 


Ree Aeee 
BlE 2" 
De ee) 


In both (2) and (2’) the column and row totals are the same as in (1) 
and they have the same meaning. 

If both loci occur on the same chromosome and crossing-over takes 
place, the gametes of an individual Aa/Bb (in the coupling phase) are 
AB, ab, aB and Ab with probabilities, say, } — 3W,} — 3W,4W, iW 
respectively (with 0 < W < 4). Crossing such individuals with each 
other yields zygotes to which the following table of probabilities corre- 
sponds: 


A Se AS. eee 
Bi iW? —2W +3) i(—W? + 2m) (3) 
b| 2-W*+2W) i(W* — 2W +1) 


In a similar way individuals Aa/bB (in the repulsion phase) where 
crossing-over takes place will have gametes AB, ab, aB, and Ab with 
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probabilities 31V, 3W, + — 3W, and 3 — 3W respectively. The follow- 
ing table of probabilities for zygotes yielded by such gametes corre- 
sponds to this situation: 


oe) eee ee a ae 
B)3i(W* +2) 10 -— W’) (3’) 


Osha" We) iW 


In both cases (3) and (3’), the column and row totals are the same as 
in (1), (2), and (2’). If W = 0 (crossing-over does not occur), (3) 
reduces to (2), and (3’) reduces to (2’): complete linkage. If W = 3 
both (3) and (3’) reduce to (1): the two properties are completely in- 
dependent. 

Increasing linkage (i.e. decreasing W) appears with respect to 
tables like (1) as an increase of the numbers in one diagonal and a 
decrease of the numbers in the other diagonal, row and column totals 
remaining constant; the contribution from linkage to each cell of the 
table has the same absolute value. 


3. Definition of components of a two by two table 


We consider a scheme of probabilities like (1), (a scheme with 
probabilities p, and g, for rows and p, and q, for columns and inde- 
pendence of these probabilities). This scheme (vector) is represented 
by 


[ee ae Pe pte, 2 ee 
MP2 U2 


and it is called the basis of the one-dimensional space of levels. 
It is our purpose to compare a scheme of counts arranged in a two 


by two table: 


4 
fun 
-=(* | where >> 2; =n, 
i=1 


U3 Us 


or rather of the relative frequencies x,;/n, with such a level. ‘The ex- 
perimental result may suggest that the independence is satisfied, that 
also the proportion p2 : g2 is not a bad description, but that the pro- 


portion p; : q@ 18s wrong. 
Then the following scheme may be more likely: 


ie = a) Po (pi a fay 
(qi — a)pz (qi — @)qe 
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wherein @ is a suitably chosen real number. Also in this scheme the 
probabilities for the columns are p, and q2 respectively and there is 
independence between row and column probabilities. The scheme is 


equal to 
ee me aol P2 Z| 
MP2 $e —P2 74e 
| P2 z| 
=H UP 
P2 a 
Sy au 


the basis of the one-dimensional space of row effects. 


Analogously, 
Ci ae G1 


is called the basis of the one-dimensional space of column effects. 
It may also happen that the independence is satisfied, but that 


both proportions p,; : q and pz : g2 are wrong, and that at first instance 
the following scheme seems to be more plausible: 


be + a)(p2 +B) (p: + a)(qe — | 
(a: — a)(p. + 8) (a, — @)(qs — 8) 


The chances for the rows are (p: + a) and (q, — a) respectively, 
those for columns (p, + 8) and (q2 — 8) respectively, the independence 
being maintained. But if, in accordance with the customary practice 
in the analysis of variance, we postulate additivity of row and column 
effects, the following vector appears: 


ee ea i | Pa a |? ZH 
NP2 142 SOX Ob Ch UA (5) 
es Es + Ape + Bp. pide + ade — ra 
UP. — op, + BQ 192 — age — BY 


We call 


a row effect and 


(4) 
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In this scheme row and column totals are as desired, but the vector 
(4) is equal to (5) plus 


aB | 1 | 
= af 7 
—aB a8 —1 1 
The remark in Sec. 2—last paragraph—may be expressed as follows: 
Table (3) can be written as the sum of (1) and 


sage aera 1 a 
16 me 1 
and Table (3’) as the sum of (1) and 


el 1 ] 
16 Bie 


So it is plausible to consider a vector 


ov, 
—1 1 
as representative of linkage, or of disturbance of independence, or, in 
terms of the analysis of variance, as interaction. Therefore we call the 


| 
i 1 


the basis of the one-dimensional space of interactions. We remark that 
in contrast with row and column effects, this basis does not depend on 
the form of the level. 

In (4) and (5) we saw that, if both row and column effects are 
present, and moreover additive, a disturbance of independence with 
respect to the scheme (4) considered as level occurs. The disturbance is 


“| : | 
—1 il 


If a and @ are small, this interaction is very small and even absent if 
a and/or 6 are zero. Then this interaction is negligible in comparison 
with possible interaction from linkage. 
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With these remarks in mind it will be our problem to determine the 
coefficients in the following partition: 


ee a a Pe ed aL | Pe “| 
s/n x4/n MP2 142 =P; —Ws 
pesca ait 
Gyn ds == A 
where pu, a, 6, and y are unknown. 


4. The Partition 


The conventional solution of yu, a, B, and y is not simple, because 
the basis vectors on the righthand side of (6) are in general non-orthog- 
onal. Orthogonality, i.e. ordinary inner products being zero, implies 
that the components can be obtained from orthogonal projections. 
Orthogonality can be obtained with a suitable choice of the following 
equivalent partition: 


Vy Vo 
Ale 
wv TS deze ase ae tel Ap, | 
Cae Cap: Dud = Opa gs 


n n 
A —B - 
+H} i a taf 3 a 
Cu = Da; —C D 
The four components are orthogonal, if the 6 inner products, of 
which two turn out to be identical, vanish: 


(7) 


A*pips + B*pig; — C'qip; — D*g,92 = 0 
A*pip, — B’pig, + C’gip, — D’gig, = 0 
A*pip2 — B’pig, — C’qip, + D’ag = 0 
ADE Gat CO Bee) ge 0 
A’p, + B’p, — C?a — D’qa = 0. 


The third equation minus p, times the fourth equation, and the third 
equation minus p, times the fifth equation, yield C? = (q,/p,) D® and 
B* = (q:/p:)D’ respectively. Substitution in the third equation 
yields: A* = (q192/pp2)D*. If we choose D? = 1/q,q2 , we obtain : 


1 1 
gS B = d = 
DiP2 ’ P12” vas QiP2 ’ 
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and these values happen to satisfy all equations. 
Substituting the values for A, B, C, D in (7) we get: 


X4 


Ly Lo 
UV Pip2 NV Pie = | Pip2 WVD1iQe 
ee 

NV ip. n 192 


V Gip2 V 192 


J @ Di ee 
25% Pi Var oF: P2 q2 a V Dips V 2192 
ae Boge 1 
qh qh P2 qe V pe V UG 


or putting: p)p2 = ™ , Dido = T., GiP2 = 73, aNd G,g2 = m4, We obtain: 


L2 


mV Vrs = {V2 Vela VE Ve 
Vi Vand VetL-vm -Vva 


Ve aa 
eae: Be “V8 4 ate | Va rol 
Veit | Vig — Va Va Vm 


V p19 1P2Je2 

The four new vectors (in brackets) are perpendicular to each other, 
indeed (7,74 = m27;). They are moreover unit vectors. The required 
coefficients are thus simply the inner products of the vector [on the left 
hand side of (8)] to be projected, and the vector on which it is projected. 
For example, » is obtained by taking the inner products of both sides of 
(8) with the unit vector of which y is the coefficient. In this way we 
obtain: 


(8) 


1 
w= (ti + ta + te + %) = 1, 
Gage Ts ice Sap ee a Z| 
Vin al 1/ ; T T2 te 13 - 14 


| 

Sle 
Camera 
3 
3 ps 
a 

8 

bo 
= 


P1 Bs 
tan eal 
| sf 4 hh 


= ale + Xa) Qs = (xa — %4) pi]; 
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or 
axes (2, as Bo) qs = (v5 == ts)Pr 
n 


In a similar way: 


8 = (Came p CAS aye 
nN 


and 
19192 — L2Qip2 = V3Pi de + LsPiP2 
Ofer 5 ‘ 


Substituting these values in (8), subtracting the first term on the right 
from both sides and multiplying both sides by V/n, we obtain: 


U1 — Nity Lo — No 
Vn V/ nts 
L3 — Nz CON, a DIRE 


V nT3 Vine 


Xe 


= (x1 + %2)u — (Chee La) Ds | Vrs ye 
Vinpids ey eS (9) 
, a+ au = (xo + 24)pol Re — we 
Vinal Van — Vr 
ae (119192 — T2QP2 — LsPide + X4PiPo) Vr a vA 
V 1p: 41) 22 ee V ts VS r1 
5. Stat¢stical considerations 
x has a multinomial probability distribution: 
n} Zi Xo rk 
Po) = atte Reet ite | % 1s = 2, 4 
where 
k k 
Yr, = 1 and a =n, 
t=1 i=1 
(nx,)"e""* 5 (nm) "6 eee (in ee 
x1! Lol x,! 
or P(x) = ao a a (10) 
Tree 


as x)! 


In Sec. 4 we had the case k = 4 and E(x;) = nz; . 
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The same probability (10) will be obtained if we formally assert that 
the x; have independent Poisson distributions with \, = nr, and 
probabilities \3‘e~*/x,! under the condition that their sum is equal 
to >-2; ; for the sum of Poisson distributed variables has also a Poisson 
distribution with parameter >A, (in this case equal to )onr; = 7), 
and its probability occurs in the denominator of P(z), as is proper in 
the case of a conditional probability. 

The same formal assertion implies that X; = (a, — nt ;)/V ne; 
has mean zero and unit variance. If further nz; is sufficiently large 
(e.g. > 9), the distribution of X, will be approximated by the normal 
(0, 1) distribution. 

The vector X that can be represented as a point in a k-dimensional 
space K, has thus approximately a probability density 


Cg 74.2 PAE pa Cea we 
however, with the restriction 


k be k k 
De =n, or >. X;:Vn0; + Dont, =n, or >) Xi,V 7; = 0. 
t= t=1 t=1) t=1 
In other words X is situated in the (k —1) dimensional subspace of K, 
perpendicular to the vector (Wm, , Wm ,-°: , Wm). 
Thus we find the (known) result that 


x= = a nm)” 


i=1 NT; 


has approximately a y”-distribution with (k —1) dimensions (or degrees 
of freedom). 

In Sec. 4 we succeeded in splitting up X (situated in a 3-dimensional 
space) in three perpendicular components each of which has a special 
meaning. From the foregoing it follows that, if H(x;) = nm; , the square 
of every component vector—which is equal to the square of the co- 
efficient belonging to it in (9)—has approximately a x*-distribution 
with one dimension. 

In the light of our definitions in Sec. 3, we can even say that the 
square of the length of the first component has a one-dimensional 
x°-distribution, if only row effect is absent irrespective of whether 
the other effects are present or not. The same is true for the second 
component if only column effect is absent, and the same for the third 
if there are no interaction and not too large simultaneous main effects. 
In other words, each of these components affords us a specific test 
criterion for the null hypothesis that there is no row effect, or no column 
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effect, or no interaction which can be tested independent of the validity 
of the other two hypotheses to a certain extent. The last restriction 
concerns the simultaneous occurrence of considerable row and column 
effects which influences the component for interaction. The numerical 
value of each of these three statistics is thus obtained by splitting up 
the known test criterion X” for goodness of fit in three terms, the first 
of which is equal to 


(ea =F Lo) qi — Gace ts)pr 1" 
NPQ : 
the second equal to 


(a1 + AOE es (itz Se La)Po\ 
NP oqo i 


and the third is the rest: 


[019192 — oN do — U3Pi Jo am LaprDo) 
NPi QiP2d2 


Just as is the case in tests with X”, the critical region in the one- 
dimensional x’-distributions will be a one-sided upper critical region. 


Example (Fisher [3]). Counting of seedlings of self-fertilized maize, 
which was heterozygous Aa/bB (i.e. in repulsion phase) with respect 
to two properties, viz. starchy versus sugary and green versus white, 
gave the following results: 


starchy sugary 


green 1997 904 
white 906 32 


If the properties are independent, then the vector level will be equal to 


ens. 

1 : 

© eo) with p. =p = 2. 
Aa 

16 16 


The ordinary x’-test criterion for these probabilities will be found to 
be equal to 12.21 + 47.13 + 180.19 = 287.69 which is significant, 
e.g. at the 1% level. 


In order to investigate the origin of this discrepancy from our 
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expectation (compare case (3’) in Sec. 2), we calculate the squares of 
the coefficients in (9): corresponding with row effect 


5G. -E x5) (vs + Ne me [ay + va — 8(a, + COME 
n 


=, 8 
4 

LPs a 

peer 3n 


with column effect 


= 0.65, 


= 2 
[tts — 3s 4 2) | =~ 0.78, 
3n 


with interaction 


(x, — 32. — 3x3 + 924)” 


on = 286.27, 


which are the same as those obtained by Fisher. The sum of these 
squares is indeed equal to the value of X*. Further, we see that the 
deviation from our expectation is practically exclusively due to inter- 
action or linkage. 


In many cases we have no theoretical indications about the chances 
for rows and columns. Then (and in the case that row and column 
effects appear to be present and a further investigation of dependence 
is wished), the unknown chances are estimated according to the maxi- 
mum likelihood method assuming independence between row and 
column chances. These estimates are proportional to the marginal 
totals and imply that X” has a one-dimensional y*-distribution. 

A more elementary way of approaching this problem and its conse- 
quence for X” and for the partition of X” runs as follows. Consider the 
set of all possible values of x/n in a 2 X 2 table with the independent 
chances p, and p, unknown and with fixed n. In the subset where the 
marginal totals are fixed, the conditional probability distribution of 
x/n will be: 


n! 2 Za Zs Te 
Pinta tee (Diq2)”*(GiD2) (4142) 


n! Zit+zZ2 Tate n: @it+2%3s G2+re 


(x4 + 2) I(x i Xa)! Pi Hs (x1 = 23) (ata + L4)! Pa & 


which appears to be independent of the unknown p,. We may there- 
fore assume any p; to be true in the multinomial distribution from 
which the conditional distribution in the chosen subset can be obtained. 
It is thus permissible to assume that in the original multinomial dis- 
tribution the p; are equal to the chosen marginal totals. We saw that 
the unconditional distribution of X with the assumed p; as parameters 
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could be approximated by the three-dimensional normal distribution. 
By imposing the conditions of the considered subset to this =X, e1.e: 
that the marginal totals of z/n should be equal to the assurned row 
and column probabilities, X will be limited to a one-dimensional space. 
In the subset the conditional distribution of such a X° will be that of 
a one-dimensional x*. Because this conditional distribution does not 
depend on the marginal totals defining the subset, it is valid in general. 
Further, it follows that by assuming the p; equal to the corresponding 
observed frequencies, the two components for row and column effects 
in the partition (9) of such a X” will vanish, and that by this very 
choice the square of the component for interaction has a one-dimensional 
x’-distribution. 

In this particular case the component for interaction can be reduced 
by substituting the marginal totals for p,; and q; to 


21% oz Lola)" 


(ty a) (fa = a) (a L3)(Lo.+ Xa) 
Or USING: 112%, — XT, = 1, (e, + V2 + ry + 24) — (%, + 22) (@, + 23) to 


E = (a, + x2)(t, + =) 


n 


n(® = #2\(% ale %)\(m a 24) (% Se %.) 
n n n n 


which will he useful later on. 


Example: If we take the same numbers as in the previous example 


and we suppose nothing to be known about probabilities, the estimates 
of expected numbers are 


esas eal 
709.30 228.69 


and X* = 17.64 + 54.70 + 54.55 + 169.17 = 296.06, which is much 
more than the 0.1% point of the one-dimensional y7-distribution 
(10.827). So there is interaction or lack of independence. The fact 
that the latter X° turns out to be larger than the former should not 
surprise us, aS maximum likelihood estimators do not in general lead 
to a minimal value of x*. Therefore it is possible that in this example 
the values p; = p. = yield a lower value of the goodness of fit criterion 


X° than the values of p, and p, , estimated by the maximum likelihood 
method, do. 
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6. 2” tables 


It is not difficult to generalize the treatment of 2 X 2 tables to that 
of 2" tables. We indicate the generalization by presenting the case 
n = 3. In that case the basis vector for the space of levels is defined 
to be a three-way table with given probabilities for the completely 
independent rows, columns, and layers (p; + q; = 1). The probabilities 
for each of the eight cells are then given by Fig. 1. The probability 
for row 1 (back face) is p, , for row 2 (front face) g, , for column 1 (left 
side) p, , for column 2 (right side) g, , for layer 1 (bottom) p, , and 
for layer 2 (upper face) gq; . 

We call a vector row effect if the sum of this vector and the vector 
level looks like Fig. 2. The probability for back space is p, + a, for 
front face (q, — a), for left side p, , for right side g, , for bottom p; , 
for upper face q; , all these being independent. 

We choose as a basis vector for row effects, column effects, and layer 
effects the vectors represented by Figs. 3, 4, and 5 respectively. 

We call a vector row X column interaction, if the sum of this vector 
and the vector level looks like Fig. 6. The probabilities in the six faces 
are equal to those in the vector level by itself, so that there is no main 
effect in this sum. The chances in the vertical edges are: 


ba +8 Dig — ‘ 

QP2— 6 ge + 6 

so that independence between rows and columns is disturbed. Further, 
the probabilities for layers are independent of those for the other 
classifications. 

We choose as a basis vector for row column interaction, row X 
layer interaction, and column X layer interaction the vectors represented 
by Figs. 7, 8, and 9 respectively. 

Finally, we call a vector second-order interaction if the sum of this 
vector and the vector level looks like Fig. 10. 

The probabilities in the six faces are equal to those in the vector 
level by itself, so that main effect is absent in this sum. The probabilities 
in the twelve edges are also equal to those in the vector level by itself, 
so that there is no first-order interaction in the sum. 

But the disturbance of independence between row and column 
probabilities in the bottom is different from that in the upper face. 
Similar remarks can be made about row and layer probabilities and 
about column and layer probabilities. In other words, the relation 
between any pair of classifications cannot be described without in- 
volving the third classification. 
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ete ie 


Fig. 2 Fig. 3 
Level + row effect Basis for row Basis for 
effects column effects 


(pip.t)q, (P,.4.-©45 


(ras Gata 


‘ 


Fig. 5 Fig. 6 Fig. 7 Fig. 8 
Basis for layer Level + row X Basis for row X Basis for row X 
effects column interaction column interaction layer interaction 
Pipe aa dost ame. 460 509 
! Pe ee 
WsP2dst)  9192.93- 
P.PPst~ “SP aPs-2 HTB [oes 
9iP2Ps-2 4TaP3*2 ot ne 
Fig. 9 Fig. 10 Fig. 11 Fig. 12 
Basis for column X Level + second Basis for second Example of a 23 
layer interaction order interaction order interaction table (Roberts e.a.) 


We choose as a basis vector for second-order interaction the vector 
represented by Fig. 11. 

About the thus defined interactions, remarks similar to that in 
Sec. 3 may be made. If main effects and interactions are present, we 
have in the place of p,p.p3 a chance equal to 


(pr = a) (D> + B)(ps +7) + 5(ps +7) + €(Do + 6B) + op + a) +7 
= Pipops + apeps + BpiPs + YPr\P2 + (eB + 4)ps + (ay + ps 
+ (By + Op. + (af + Be + yb + aBy + 7). 


From this it will be seen that three main effects together introduce 
contributions to all of the four interactions. In that case certain 
first-order interactions may nullify the additional contribution to the 
second-order interaction. If there are two main effects only, there 
will be a contribution to the first-order interaction between them. If 
in this case one or both of the other first-order interactions are non- 
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zero, a contribution to the second-order interaction may take place. 
If there is only one main effect and simultaneously an interaction between 
the two other classifications, there will be a contribution to the second- 
order interaction. If there are no main effects at all, the second-order 
interaction obtains no contributions from possible first-order inter- 
actions. We may conclude that disturbing contributions to inter- 
actions may be caused by main effects, but that they will be negligible 
if a, 8, and y are small. The drawback of such contributions is not 
great as it is not our intention to estimate the different effects, but to 
investigate the origin of discrepancies between expectation and ex- 
perimental result. 

It will be our purpose to split up a vector of experimental numbers 
z;/n (i = 1, 2, --- , 8; >>; x; = n) into eight components in the direc- 
tions of the defined basis vectors. To facilitate this partition, we again 
divide the numbers on similar places by the square root of the probability 
of that place in the vector level. Putting those probabilities equal to 
a; , we obtain the following formulation of our problem: 


5 
Xy ae ak 
a a, a 
x, x 
5 TE VT 
aie Wis 
et are pms ee Age ae 
Vij—+— Vig Vitg—— V6 Vig —|— Vig aa 
=p 4% , i if) ! fe if wm 
= vou 1 creat ' P.4 
on ae vg PX vig IT P2da ee ee "Pata IG -- == MG 
vr vig Vir, Vi Vig Vig Witz Vg 
Vr, Vig. Vtg 
ot aie ae ere ee 
Mig Vis: Yi ea Ni Nii —— Vig) say 
As & pate: 3 Fj eT oe 
VPqiPada VE Vig YEP s vig See A eee We PRs Vig--- NM VP 4292 P43 Vitg-- pee etl 
wg Vig "095 ae 
Fig. 13 
Equation (11): Partition of 2% table 
Calculate the coefficients », a, 8, --: 7 in the equation represented 


by Fig. 13 which will be called equation (11). All the ‘‘cubes” on the 
right-hand side of this equation are orthogonal unit vectors. 


(Observe that %! = 7 = "= ™ And Ss 


2 4 6 Tg 13 4 7 Ts: 
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The required coefficients are thus again the inner products of the 
vector on the left side to be projected and the vector on which it is 
projected: 


w=, 
yee Cia Sah te ieee) a aaa ee Xs)D1 


n 


2 


= (Qt te Oreo 7) Yo te se Seok fe oe Ls)Po 


B , 
n 
we (ay + @ + @s + Ta) qs — Ws + Bo + Sr + Ta)Ds 
n >) 
5 — Gat 25)grg2 = te + 20) pe — (a + %7)piPe + (x4 + &s)PiP2 


n 


Re (ay ='3 L2) 193 cies (xs = L4) P13 = (x5 an Le) QiDs =5 (x7 ik Ls)PiP3 
n , 


Ba (Gy =F &s) 0205 — Oe ta) Dads — (nt @7)Qop3 + (Xo + Xs)P2s 
n tf 


g 


— 21919293 ~ %24iP293_— LsP19o93 + LaPiP2Ys 
n 


_ 5G 92Ps — LoQP2Ps — C7PideP3 + LsPiP2Ps_ 
n 


Substituting these values in equation (11), Fig. 13, subtracting the firs t 
term on the fight from both sides and multiplying on both sides by ~/n, 
we get on the left a vector X consisting of numbers (x; — nz;)/-V nz; 
the square of which is again equal to the well-known goodness of fit 
criterion. X” has approximately a x*-distribution with 7 degrees of 
freedom, if the null hypothesis that the vector level contains the prob- 
abilities for each cell is true. Further, the seven components of X on 
the right are perpendicular, and X” is equal to the sum of the seven 
squares of these components. Hach of these squares has under the 
null hypothesis a x*-distribution with one degree of freedom. 

Now again every component can serve as a statistic for testing a 
specific hypothesis. With the first three components we test a = 0 
B = 0, and y = 0 respectively, with the following three Components 
the hypotheses 6 = 0, « = 0, and ¢ = 0, under the condition that a8, 
ay, and By are negligible respectively, and with the last component ihe 
hypothesis 7 = 0, under the condition that af + Be + y6 + ane is 
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negligible. Each of these hypotheses is tested on the assumption of 
independence of the three classifications. 

The mentioned restrictions do not trouble us if no theoretical chances 
are available. Generalizing the considerations at the end of Sec. 5, 
we take the p; equal to marginal totals with the consequence that 
a=8=y=0. Ifany product a, ay, or By appears to be not negligible 
and a further investigation of interactions is required, it is recommended 
to take the relative p; from marginal totals. 

A more difficult situation will arise if a first-order interaction appears 
to be considerable or if it is expected in advance that any pair of classi- 
fications is not independent. This situation is obviously contrary to 
the assumption of independence of the three classifications and will be 
considered in See. 8. 


Example: (Roberts, Dawson, and Madden [14].) Crossing mice AaBbCc 

with aabbce gave numbers represented in Fig. 12, (A in back face, 

B in left side, and C in bottom). In the level p; was } (¢ = 1, 2, 3). 
The value of x” for row X layer interaction, e.g. is 


a Eg Sy er es ee 
NPi Ni P3 73 nN u 
and for second-order interaction is 
n ee hg er i 
MP1 9i1P292P373 n 
The seven values for x’ are: for row effect 163 
column effect 0.67 
layer effect 1.03 
row X column interaction (0), 13 
row X layer interaction 4,25 
column X layer interaction 0.13 
second-order interaction Dy eS, 
total 10.63 


The one-dimensional x” has at the 5% level of significance the 
critical value: 3.84; the 7-dimensional x’: 14.07. The total x” is not 
significant. Concluding that the row X layer interaction is significant, 
would be rash: the probability that at least one of seven one-dimensional 
x° is larger than 4.25 is about 0.24. So there is only a slight indication 
that linkage between A and C may exist. 

It is not always possible to attach a simple meaning to a second- 
order interaction, but in cases like this it could be caused by the fact 
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that one of the eight gene combinations, as a consequence of diminished 
viability, is much less frequent than is expected on account of main 
effects and first-order interactions only. 

About the calculations we can remark the following: x” for row X 
column interaction can be determined as the interaction x” in the two 
by two row-column table which can be formed by adding along the four 
vertical edges; the formula for 6 is then the same as that for y in a two 
by two table. Thus one can calculate this x” again from the test criterion 
X? for this two by two table and by subtraction of the main effects 
for rows and columns. The second-order interaction x” is found as 
the difference of the test criterion X° for the 2° table and the sum of 
those for main effects and first-order interactions. 

If the probabilities p; are not known, we take them such that the 
main effects equal zero. In our example we then obtain the value x’ for 


row X column interaction O12 
row X layer interaction 4.34 
column X layer interaction 0.14 
second-order interaction 2.69 

total (P48) 


The 4-dimensional x” has at the 5% level of signicance a critical 
value: 9.49. The total x’ is not significant. The probability that at 
least one of four one-dimensional x” is larger than 4.34 is about 0.14; 
so there is a slight indication that linkage between A and C may exist. 

An example of the partition of a 2° table in 31 components is given 
by Haldane [5]. 


7.mXnX --+ tables 


In some particular cases a partition of m X n X --- tables (and 
even of non-orthogonal tables) may have sense. We indicate the case 
of a 2 X 3 table which can be generalized easily. 

An inquiry into the attitude with respect to a political proposal 
may be summarized in a table of experimental counts: 


for against no opinion 
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In this case an appropriate definition of a vector level will be: 
pea PiP29s P92 
QMP2P3 W1P293 Gide 


The chances for rows (men and women) are p, and g@, respectively, 
those for columns 1 and 2 together (politically interested), and for 
column 3 (politically not interested), p, and q, respectively, and those 
of columns 1 and 2, p,p; and pzq; (i.e. p; and g; under the condition of 
being in column 1 or 2). Moreover, all these probabilities are in- 
dependent. Without further explanation we define as basis vector for 


row effect: 
PDs Pads a 
2D san sl3s sa Is 


Column effects will have two basis vectors, one corresponding with 
modifications of p, and q, , independence being maintained: 


bes PQs ae; (12) 
Gps Gigs — Qi 


and one corresponding with similar modifications of p; and q; : 


bes ne | or rather ” he |. (13) 
“P22 —Up, O ma —-H O 
If two such column effects occur together, viz. @ times (12) and y times 


(13) and, if they are additive, we obtain: 


palprB)( pat) 2 n(ert6)(as-~) a BYP: ?i(q2—B) 


| where p; + q; = 1, Geo D3) 


Ds P2 P2 
apo) pt) +L gi(Pp2+B) @ —1) Bras 91(q2—B) 
This is: 


By FE —p, | 

ue bee I AY 
less than a level with probabilities p, , po + 8, ps + y/p2, etc. With 
respect to this level there is a deficit of Gy/p2 times the column effect 
(13). As @ will be small in comparison with 7, , By/p2 , however, will 


be negligible in comparison with y, and even will be zero, if 6 or y 1s 
zero. Similar remarks can be made about simultaneous occurrence of 
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additive row and column effects; there will be disturbances of independ- 
ence which will be negligible in comparison with proper interactions. 
We choose as basts vectors for interactions: 


i‘ i ke 4 

esd ye mil y ye 

which represents the disturbance of independence between row prob- 
abilities, p, and q, , and column probabilities, p, and q, , and 


| 1 -1 ‘| 

—1 LEO 

which represents a similar disturbance of independence between row 

probabilities and column probabilities, p; and q;. It will be our purpose 

again to split up a vector of experimental numbers 2;/n (7 = 1, 2, --- 6; 
x; = n) in six components in the directions of the defined basis 

vectors. We divide again the numbers on similar places by the square 


root of the probability of that place in the vector level. Putting those 
probabilities equal to 7; , we obtain: 


49; Lo 
nV ry n TT: 


3 
2 NV 73 


vs v6 


H 07 
Ee. UVa NV 16 
+ yee] Var Vas va 
NPGS | Ey wey 
eal V P1424 nee 
Veal Vaan Vad —V ops 
Near pe = V pide | 
VPPele | / qa, — Voip, 0 
fe VUGP: Via — | 
Vv PiGP2d | —/ pgp, —Vpideds WV pids 


ae | V 0193 — V dps ‘| 
V Di VPsPs4s —V 0193 VpiPs O 


= 1 Re 1 V1 V3 | 
Vis Vi 5 V5 


(14) 
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The six new vectors (in brackets) on the right are orthogonal unit 

vectors. The coefficients are obtained again by taking inner products: 
B= 1, 


1 
= — [(@ + v2 + a3)q, — (a1 + 25 + 2,)p,], 


® 
I 


(tr + 2 + 2%, + 25)q2 — (ts + 25)pol, 


ee [(a, a Ls) Ys ee L5)Ps ) 


oO 
| 


[a1 + T2) G92 — LQiP2 — (4 + slide + Lepipol, 


1 
= * [219193 — %oQips — XsPida + LsPips |. 


Substituting these values in equation (14), subtracting the first term 
on the right from both sides and multiplying on both sides by ~/n, we 
get on the left a vector X consisting of numbers (x; — nz;)/~Wnm,; , the 
square of which is again equal to the goodness of fit criterion. Under 
the null hypothesis that the vector level is true, the square of each 
term on the right has a y’-distribution with one degree of freedom. 
It will be remarked that the x’ for row effect, the first column effect, 
and the first interaction can be obtained as the x’ for the similar com- 
ponents in a 2 X 2 table which is deduced from our table by amal- 
gamating the columns 1 and 2. The x’ for the second column effect 
and the second interaction are equal to 


[(a1 == ts) Qs — (to + ts\Dal Ba (219193 = Maia tapi Gat tsps)" 
P3243 ‘Mp2 PihPsIs ‘Np2 


respectively. They will be obtained as the x” for column effect and 
interaction in the 2 2 table consisting of the first and second column 
[see equation (9)], with the restriction, however, that we do not use the 
experimental total, 7; + x, + x, + 2; , in the denominator, but the 
expected total np. . 

If there are no theoretical values p; , then we take them such that 
main effects are absent, i.e. from marginal totals (these are also maximum 
likelihood estimations). Then the x’ for the first interaction will be 
found as the ordinary x’ test criterion for the 2 X 2 table obtained by 
amalgamating the first two columns and the x” for the second inter- 
action as the x’ for interaction in a 2 X 2 table consisting of the first 
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two columns, but with row and column probabilities estimated from 
the marginal totals of the whole 2 X 3 table. In this case np, is of 
course taken equal to x7, + % + % +25. 


Example: The inquiry mentioned in the beginning of this section may 
have the following result: 


for against no opinion 
men 1154 475 243 
women 1083 442 362 


(These numbers are taken from Introduction to the Theory of Statistics 
by A. M. Mood, page 273, where they occur as an example of a 2 X 3 
contingency table.) Theoretical probabilities being absent, we calculate 
the two-dimensional x” for the whole table with the aid of expected 
values obtained from marginal totals: 


1114.04 456.67 301.29 
1122.96 460.33 303.71 


and we find 26.78. The critical value of a two-dimensional x’ is 9.21 
at the 1% level. So we conclude that there is association in our numbers. 
In order to investigate the origin of this association, we calculate the 
x° for the first interaction, which is found as the ordinary x” from the 
amalgamated table: 


opinion no opinion 


men 1629 243 
women 1525 362 


and with expected values also obtained by amalgamation of the calcu- 
lated expected values: 


* 


1570.71 301.29 
1583.29 303.71 


and we find 26.77. By subtraction it follows that x? for the second 
interaction is equal to 0.01. We conclude therefore that there was a 
difference in interest between the sexes, but that no difference in attitude 
against the proposal could be detected between men and women. 
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In general each one-dimensional x” in a contingency table, where 
the probabilities for rows and columns are estimated from the data 
and where a similar partition takes place, is connected with one of the 
2 X 2 tables which are obtained by successive amalgamations of the 
data. Such a x’ is calculated as the ordinary x’ for interaction (as 
derived in Sec. 5) in this 2 X 2 table of which the table of expected 
values is obtained by a similar amalgamation of the complete scheme of 
expected values (estimated by the maximum likelihood method). The 
last results where theoretical chances are unknown are the same as 
those of Lancaster [10] and Kimball [8]. In connection with the fore- 
going it may be remarked that the method, suggested by Lancaster 
[11] as an exact one, for calculating x” in a contingency table where 
cells with small expectations are pooled, does not seem to be correct. 

When pooling of cells takes place it is not correct to say only that 
one or more one-dimensional x” reduced to zero. If, e.g. in a 3 X 8 table, 
two cells in the same row are pooled, in other words are conceived as 
one cell, these cells do not contribute information about the estimation 
of the probabilities for the columns to which these cells belonged before 
pooling. For the vector level can be described as: 


PiP2Ps | PiP29sPs | PiP2s 
PideP3s | PidedsPs | Pide93Qs 


WP 19s 


with p; + ¢g; = 1@ = 1, 2, 3, 4). 

The estimation of p, and p, takes place from row totals in the same 
way asin the 3 X 3 table. Similarly, the p, will be estimated from the 
totals of the first column and the total of the second and the third 
column together. But p, will be estimated from the proportion of the 
totals in what has been left of the second and the third column, namely 
in the first and the second row. In the expressions for several of the 
one-dimensional chi-squares that do not vanish by pooling, the 
(estimated) values of p, and qg, will in general be different from the 
values obtained by estimation in the complete 3 X 3 table. 

The same conclusion follows from the fact that as a consequence of 
such pooling in the complete partition of a 3 X 3 table according to 
main effects and interactions, the basis of column effect with respect 
to p, will coincide with that for interaction in the 2 X 2 table formed 
by the second and third column on the one hand, and by the sum of 
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the first and second row and the third row on the other hand. Thus 
not only this interaction but also the mentioned main effect should 
vanish in the partition after pooling, in order that X°, which measures 
the discrepancy with respect to a level, as represented here, contain 
three interaction components only. 


8. Further remarks about 2° tables 


Several authors (Kendall [7], Simpson [15]) warn against amalgamat- 
ing 2° tables to 2 X 2 tables, even when second-order interaction 
happens to be absent. They demonstrate the possibility: 

(a) that interactions, which in each of the amalgamated classes 
separately tend in the same direction, seem to be absent after amalga- 
mation, or 

(b) that the amalgamated classes separately do not show dependence 
between the two other properties, but that they do together. In our 
opinion this warning is exaggerated in many cases and danger threatens 
from another direction. In our view, as has been shown in Sec. 6, 
dependence between two classifications in a 2° table will just be tested 
ina 2 X 2 table obtained by amalgamation in the 2° table, irrespective 
of whether second-order interaction is present or not. 

While second-order interaction is absent, the case (a) may be con- 
structed by adding a level, a row X layer interaction, and a column X 
layer interaction. In the bottom of the table this appears as adding 
of a level and suitable row and column effects in a 2 X 2 table so that 
a small interaction occurs in it (see Sec. 3). In a similar way a small 
interaction which will have the same direction appears in the upper 
face. However, in the 2 X 2 table obtained by amalgamation of bottom 
and upper face, interaction will be absent. 

Also, while second-order interaction is absent, the case (b) may be 
constructed in the following way. First we form the sum of a level 
with p; = 3, a row X layer interaction, and a column  X layer inter- 
action. Because the interactions in the bottom and in the upper face 
of this sum are identical, we can add a row X column interaction to it 
such that both interactions in the bottom and in the upper face separately 
vanish. The 2 X 2 table obtained by amalgamation of bottom and 
upper face shows row X column interaction of course. These dis- 
turbing interactions, however, will be negligible in comparison with 
proper interactions due to dependence. 

The danger to which we alluded consists in maintaining the hypo- 
thesis of independence in the model of a 2° table and the corresponding 
partition, although a first-order interaction appears to be considerable, 
or independence between one or more classifications can be expected 
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to be impossible in advance. For the discussed x7-test and the partition 
in a 2° table—and with this remark we proceed on what we said in 
Sec. 6—are only justified if the three classifications are independent. 

In a case, e.g. like the example given by Simpson [15] where the 
result of a treatment against a disease is investigated by counting 
dead and alive in males and females, it is not obvious that the prob- 
ability of being treated is the same for males and females. 

If one first-order interaction must be taken in account, we have to 
choose a new model to test other interactions. The new model for 
this case is that fora 2 X 4 table. Let the experimental result multiplied 
by n = >-3_, 2; be: 


treated not treated 
male female male female 
alive el Xe Xs XL 
dead X23 rs Zz Xs 


After the choice of a level similar to that in the foregoing section: 
ee PiP29sPs P2G2P4 af 
PiP2P394 PiP29sd4 Prides Ud 


the basis vectors for interactions may be: 
ober Ee (Got | 
—1 i 8 Taft ate 170 


| P2P3 P27 Ya ‘| 
40h OKO Ops 1 
For convenience we put the p; as unknown for the rest of this section 


so that no main effects are present. 
With the mentioned example in mind we prefer to define the vector 


level as: 


and 


ee. PideDs iPsPa oe 
) 
Piped, Pideds UNPsds 19394 
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which is the same as that for a 2° table if p. = p;. A basis for inter- 
actions may be formed by: 


Do 2 —Ds | i an 0) ‘| 
—P. —Q Ds Is —1 i @ @ 


and (15) 


|" Oe ea 

00 -1 1 

The first of these three vectors represents dependence of death rate 
from treatment, the second and the third represent interaction of sex 
and death rate within the treated and within the not-treated individuals 
respectively. If one prefers to consider dependence between sex and 
death rate and further dependence of death rate from the treatment 
within the sexes separately, one should only transpose the second and 
third column in each of the four vectors. We proceed in the former 
version. 

The last pair of vectors can be replaced in several ways by two other 
vectors, the one representing a common interaction between sex and 
death rate in both treated and not-treated individuals, the other repre- 
senting a difference in dependence of death rate from sex between 
treated and not-treated individuals, and, moreover, such that a par- 
titioning of X” in independent components corresponds to this choice. 
We prefer: 


PiP292 —PiP242 MP3s eal 
—P1P242 PiP2d2 — PQs P34 


1 <1 =] 4 
—1l 1 1 -1 
The first reason for this preference is that the mentioned difference in 
dependence—which may be called second-order interaction—is repre- 
sented by the same basis vector as in the 2° table. A second reason will 
be given in the treatment of the following cases A third reason will 
appear later on. The component of X” corresponding to the first vector 
of (15) can be calculated as the test criterion in the 2 < 2 table obtained 
by neglecting sex. The computation of the other components will be 
treated in the following. 

It is also possible that two first-order interactions must be taken in 


and 
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account. Let us assume also that an interaction between treatment 
and death rate exists in the discussed 2 X 4 table. We take as a new 
level: 


ee Pid2ePs QiPsDs oa 
PiP29s Pid2ds GPsds 1939s 


which is the same as that for a 2 X 4 table if p, = p,; and as that for a 
2° table if, moreover, p» = p;. This level corresponds to that for two 
separate and independent 2 X 2 tables. A basis for interactions may 
be formed by the last two vectors of the set given by (15). 

This pair can be replaced again in several ways by another pair of 
vectors which express a common interaction between sex and death 
rate in both 2 X 2 tables, and a difference between such interactions 
respectively, and which admit a partition of X” in independent com- 
ponents. We choose: 


PiP2492Psds  —PiP292Ps44 M1P393P55 gs 
—PiP292P444 PiP292P4d4 — W1P393P5%5 1P373P595 


1 -1 -1 i 
—1 i 1 —1l 
The difference in interaction is expressed again by the same vector as 


‘ in the previous cases. The component of X”* corresponding to the first 
vector of this pair is equal to: 


(qo Q4X1 — P2Ga%2— F2PsX3 + DoPslat Ys 4505 — D350 — VsP5l7 +psPsts)” é 
N(D1P2G2P194t VPs VsP54s) 


According to the end of Sec. 5, this may be reduced to: 


and 


(a1 — npipopa + 25 — NUPsPs)”, 
MD1P242Psds + NUPsIsP54s 


As we know that a quantity like 

Ly — NPip2ps 

V np, P2Q2PsY4 
(with n sufficiently large) has a standard normal distribution, we may 
consider x, as a normally distributed variable with expectation npip2p4 


and variance np,p2q2P1q1 . The component is thus the square of a 
normally (0, 1) distributed combination of two normally (0, 1) dis- 
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tributed variables of the considered kind and such that the numbers 
np, and nq, have some weight in this combination, indeed, but not 
too much. This balanced combination of statistics for testing independ- 
ence in 2 X 2 tables, which is also recommended by van Eeden [2], is 
the second reason for our preference mentioned in the previous case, 
which is obtained by equalizing p; to ps and q; to g, respectively. It 
may be remarked that the chosen measure of dependence agrees with 
Kendall’s [7] quantities 6 both in the case of a 2 X 4 table and in that 
of two 2 X 2 tables. The common interaction is thus not obtained by 
amalgamating the treated and not-treated classes, but by adding the 
two values of this measure of dependence, and this as a consequence of 
choosing the appropriate model. 

The case where three first-order interactions must be taken in account 
will be considered now. If none of the faces of the original 2° table 
plays a special role in this case, we must have a definition of a level con- 
taining three interactions which is independent of the choice of the 
properties allotted to rows, columns, or layers respectively. After 
Bartlett [1] we choose a vector consisting of 7; (@ = 1 --: 8) with 


8 

3D vw; = 1 and amyrgret, = ToT3T;Ts = Y- (16) 

t=1 
We remark that this relation was also valid for the levels of the 2° 
table, the 2 X 4 table, and the two 2 X 2 tables. Conceiving e.g. 
w1/%. 2 13/7, = 1174/ToT3 AS A Measure of interaction in the 2 * 2 


table 
fe | 
T3 «Ws 


as is recommended for measuring linkage, we see that the relation (16) 
involves equality of interactions in every pair of faces of the original 
2° table. The difference between the experimental vector and the 
maximum likelihood estimate of the level—which cannot be obtained 
in that simple way (i.e. from marginal totals) as in the cases discussed 
till now—will be called second-order interaction again. The corre- 
sponding X* has a one-dimensional x’-distribution because the estimation 
of the level implies the estimation of six parameters. 

The estimates #,; are obtained by determination of the maximum 
of the likelihood function: 


8 
C + >>; log; under the conditions (16). 


t=1 
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Differentiation of the function: 

8 8 

2» x; log a; + Mayra, — ToW3T 31s) — ul > v; — i) 

= i=1 


with respect to 7; gives the equations: 


-_ => =v p= 0, Rees Ie dG 7) 
and 
a; — \ a 
lee 0, (2 = 2,3, 5, 8). 


>=3_, 7.P; yields: u = n. Putting \y = 6, we obtain: 


% + 6=nt , ts — 6 = Nits , 
ee i ie te + 0 = Nite , 
i 0 — Tia. Lz + 6 = Nir, 
+6 = Nh, , Ue 10) = iia 


From this it will be seen that second-order interaction is a multiple 


of the vector 
| ci ees ena 
—1 1 1 -1 


here again, namely with coefficient 6/n. By substitution in (16) the 
following equation for 6 is obtained: 


(21 + 5)(a. + 5)(%e + 6)(a7 + 8) = (42 — 4)(t, — 5)(%5 — S)(tg — 8). 


Lancaster [12] showed that the test criterion X? = (6°/n) >7%_, 1/#; 
calculated after solution of this cubic equation is asymptotically equal to 


2 
(2 Som, se _ Be, e_m) 


usi 12 13 4 Ws 16 7 Tel, (17) 


Now the component of X” aimed at testing second-order interaction 
in the discussed models with no, one, and two interactions is equal to 
the same expression (17) for any n with the restriction only that the 
+; may stand for estimates of the true 7; occurring in the level of the 
relative model which, however, converge to the true 7; for large n. 
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We see that the test criterion for second-order interaction belonging to 
any of the four discussed models—on condition that the underlying 
hypothesis expressed by the level is true—is asymptotically the same 
as that belonging to the following level if we observe the order of our 
treatment. This order implied that every model represented a stronger 
assumption about the 7, than the following models. 

In other words, we may conclude that Bartlett’s test of second-order 
interaction (admitting three first-order interactions) is asymptotically 
independent of whether any of these interactions is present or not; that 
our test of second-order interaction admitting two first-order inter- 
actions is asymptotically independent of whether any of these two 
interactions is present or not, but is not valid if three interactions occur 
in fact; that our test of second-order interaction admitting one first- 
order interaction is asymptotically independent of whether this inter- 
action is present or not, but is not valid if one or two of the other true 
interactions is not zero; that the test of second-order interaction as- 
suming no interactions (i.e. Lancaster’s procedure) is only justified if 
no true interaction occurs in fact; that the value of any of the relative 
statistics is asymptotically equal to those admitting more interactions 
if only the null hypothesis belonging to the first statistic 1s not too 
narrow in the sense that interactions are supposed zero although they 
are present. This result, an extension of Lancaster’s [12] remark, was 
the third reason for our preference in the choice of specific basis vectors. 

Finally, a remark proceeding from a consideration of Bartlett’s [1] 
numerical example also discussed by Lancaster [12]. This example 
showed a three-way classification of numbers of root-stocks according 
to time of planting (at once and in spring), to length of cutting (long 
and short), and to success (alive and dead): 


at once in spring 
long short long short 
alive 156 107 84 31 
dead 84 133 156 209 


Partition of the four-dimensional y* corresponding to a 2° table yielded 
95.58 for interaction between time of planting and success, 45.40 for 
interaction between length of cutting and success, 0.00 for interaction 
between length of cutting and time of planting, and 0.07 for second- 
order interaction. In connection with these large interactions, Bartlett’s 
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and Laneaster’s criteria are not equivalent and they will not be expected 
to be equal. If we formally follow the procedure discussed in this section, 
a new model, assuming two first-order interactions, would be needed 
for a further investigation of second-order interaction. We would 
consider these two 2 X 2 tables: 


alive dead 
long short long short 
at once 156 197 84 133 
in spring 84 31 156 209 


The two-dimensional x’, 7.41 (the sum of 6.50 and 0.91) could be par- 
titioned in 5.26 for interaction between time of planting and length of 
cutting, and 2.15 for second-order interaction. Bartlett’s criterion was 
equal to 2.27, so that the two criteria do not differ much now. The 
difference could be ascribed to a (formal) interaction between time of 
planting and length of cutting. 

But in our opinion the whole procedure (also Bartlett’s) seems to be 
wrong in this special example. For equality or unequality of depend- 
ence in the two considered 2 * 2 tables has no practical sense and will 
not be an object of investigation in this case. Moreover, the fact that 
the number of root-stocks is equal for all treatment combinations— 
which led up to an interaction x” exactly equal to zero—suggests that 
these numbers were not random but fixed before the execution of the 
experiment. An interaction between time of planting and length of 
cutting must thus be excluded from the model. For that reason we 
have already referred to this interaction with the term formal. 

In this and in similar cases we have to consider four independent 
binomial distributions defined by four chances 7; , in this example, 
chances of alive according to: 


at once in spring 
long short long short 
alive ™ 12 1s 4 
dead 1—m™ 1 — mm 1 — 73 1 — m% 
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Here interactions are to be defined again. In the particular case where 
the numbers in every column are equal (as in this example), say 7, the 
experimental result may be partitioned as follows: 


ue XH, Ho Lz Xa Pah Tv Tv a, Tv 
dn 4 = Le 
Us Ue H7 Xs l—qar l-—@q 1 T 1 T 
ee te aa iw 
il =) ih i —I 1 1 


6 1 -1 -l hi 
—1 1 1 -l 


This partition corresponds to a partition of the three-dimensional 
test criterion for independence in a particular 2 X 4 table in three 
independent components. The second vector on the right represents 
interaction between time of planting and success, and the third vector, 
interaction between length of cutting and success. The sum of the 
first three vectors on the right is proportional to: 


Fenn annie uh weer a ihann, oe 
Ue ees emer 2 beech ee eaty AP! Nest cip eI! cee 1 UA mez me A Ue 


i.e. a vector where 7, — 7. = 73 — Ta, 71 — T; = TW. — 7, and similar 
relations between 1 — 7; are valid. Such a vector where the (positive 
or negative) raising of the chance of alwe or of dead by long 
cutting is the same for both times of planting, and where this raising 
by planting at once is the same for both lengths of cutting, seems to be 
a natural definition of the hypothesis no second-order interaction for 
this case. The fourth vector will represent second-order interaction, 
i.e. Inequality of the mentioned raisings. The corresponding partition of 
X° is for this example numerically equivalent to that at the beginning 
of our remark about it. If the numbers in the columns are not equal, 
a first-order interaction, e.g. between time of planting and success, 
can be tested in the 2 X 2 table obtained by neglecting the other classi- 
fication (length of cutting). 

A test for second-order interaction will imply then (and also when 
the partition described shows considerable first-order interactions as 
in this example) a maximum likelihood estimation of the 7; under the 
hypothesis of ‘‘no second-order interaction,” i.e. 7; + m, = wm. + 13 . 
To that end we determine the maximum of the likelihood function: 


4 4 
C+ > a; log; + >> (nm; — x, log (1 — z,) under that condition. 
i= i=1 
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Differentiating the function: 


oD z; loga; + es Cie are) log (1 — 7.) + May — Tm. — 13 + 74) 


t=1 t 


with respect to 7, yields four quadratic equations in 7, , 7 ta, .and 
m, respectively. The usable solutions (0 < z; < 1) are: 


ee i Vio 1) ba: 


- 2r 
pe N= V OE wy — Ara; 
: aN 
ae FN sme eee Vr ate Ns)" — 4dx; 
an Qn , 
, _rsA=m + VA — 2)? + De. 
4 — D» . 


Substitution in the relation between the z; yields the following equation 
for X: 


V(n, — rd)? + 4rn, + V (ns +r)? — 4Az, 
Paling A — Da, A/G Ye = Sn, 


t=1 


from which a solution different from the trivial solution \ = 0 is required, 
unless an approximate solution which may be given by 


and which can be improved by usual methods, is exactly zero. In 
that case the solution of A is zero. 

In Bartlett’s example this approximate solution of \ was equal to 
4.91. This could be improved to 4.92078. Solving the #,; with the help 
of this value gave the following table of expected numbers: 


157.11 105.78 82.89 oa 
82.89 134.22 157.11 208.44 


The one-dimensional x” for second-order interaction according to our 
definition of no second-order interaction appeared to be equal to 0.082. 
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1. Introduction 


The validity of the hemacytometer method for estimating the 
density of erythrocytes depends upon several factors. Among these, 
it is necessary to assume that the aliquot of blood taken from the 
organism is truly representative of the total blood mass. In addition, 
it must be assumed that the erythrocytes in the aliquot become 
dispersed completely at random over the surface of the counting 
chamber. The manner in which the erythrocytes randomly distribute 
themselves into the various square divisions of the chamber grid becomes 
of interest by virtue of the fact that knowledge of this distribution can 
yield information concerning the precision of the estimated number of 
cells. Abbé [1878] seems to be the first to suggest that there is a relation 
between precision and the magnitude of the count itself. He proposed 
using as a measure of precision the square root of the mean number of 
cells per division as a consequence of the distribution law which he 
derived. This distribution law was in fact identical to the well-known 
Poisson distribution [1837]. Student [1907] independently derived 
Poisson’s distribution in regard to counting yeast cells. 

It is well known that the mean number of cells per division is a 
good estimate, in fact the maximum likelihood estimate, of the variance 
of a Poisson distribution; so that the standard deviation might simply 
be taken as the square root of the mean as Abbe proposed. However, 
in more recent years, Berkson [1938] has calculated the mean and 
variance of a number of cases of actual red blood cell counts made 
under very excellent counting conditions and has shown that the variance 
is almost invariably smaller than the mean. Berkson tested the equality 
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of the mean and variance, implied by the Poisson distribution, on this 
data by the use of Fisher’s [1922] chi-square test. 

Lancaster [1952] investigated the use of this chi-square criterion 
for controlling counting and found it quite adequate to the task. 
Berkson and his associates discuss the adequacy of the Poisson model 
in a number of papers [1935, 1936, 1938, 1939, and 1940]. Whitaker 
[1914] and Student himself [1919] both give illustrations of the inade- 
quacy of the Poisson in some particular instances and show that in 
many of the common applications of the distribution the underlying 
assumptions are not even approximately met. 

Lancaster [1950] attacked the problem of crowding of the erythro- 
cytes which had been noted by Berkson, et al. [1935]. He derived the 
consequences of crowding upon the relationship of the variance to the 
mean but did not offer a complete distribution law to substitute for the 
Poisson distribution. 

Plum [1936] should be consulted for the early history of the subject 
and Lancaster for later developments [1950]. 


2. The Poisson-Binomial M odel 


It is a well known fact that the Poisson distribution can be con- 
sidered to be a limiting form of the binomial distribution. This latter 
distribution can be derived for the hemacytometer situation in the 
following manner. 

Let us suppose that there are a total of r blood cells which finally 
come to rest under n squares of the hemacytometer grid. We would 
like to know in what manner these r cells will distribute themselves into 
the squares of the grid. In other words, what is the probability that 
there will be no cells in a given square; what is the probability that 
there will be one cell in the given square; or two cells in the given square; 
or three cells; or in general k cells in the given square? Now if we 
assume that all of the r cells are distinguishable and that all n” possible 
arrangements are equally likely to occur, then the probability of obtain- 
ing exactly k cells in a given square can be found. 

It is seen that this probability P(k) is simply the 


No. of arrangements providing k cells in the given square 
Total no. of possible arrangements 


Now, there are (") ways of selecting k cells out of r cells and for each 
r—k 


of these ways there is (n — 1)’ " ways of arranging the remaining 
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r — k cells into the remaining n — 1 squares. Thus, 


(j= 9 


P(k) = = area (1) 

This is the general term of the series for the binomial [1/n + (n — 1) ES 
In the hemacytometer situation n = 400 if we confine our attention 

to the smallest subdivisions in one large square of the standard hemacy- 
tometer. Each of these smallest squares has an area of 1/400th of a 
square mm. The total number of cells under the grid, 7, is usually of 
the order of one to three thousand. With values of n and r in this range 
it can be easily demonstrated that the general term of the Poisson 


distribution 
1 r\(r\* 
EP iA) (2) 


can be used as a quite good approximation for P(k). Now it is well 
known that the variance of the binomial distribution is rpq which in 
the notation of this paper becomes 


£2) : 


but this is, to a close approximation when n = 400 just r/n, which is, 
in fact, the mean. It is also true, as previously mentioned, that the 
variance of the Poisson distribution is equal to the mean. ‘Thus, it 
can be seen that if this binomial-Poisson model were appropriate we 
should expect the estimated mean and the estimated variance to be of 
equal magnitude on the average. This is not the case, as Berkson 
showed. The difficulty can be traced to the assumptions utilized in 
deriving the model. We assumed that all 1’ possible arrangements are 
equally likely. This implies that there is no crowding; that is, if a cell 
occupies a square, that it is just as likely that another cell would come 
to rest in that square as come to rest in any other square, one, say, 
that has not been previously occupied. Now, this is a reasonable 
assumption so long as a square is not nearly filled with cells. However, 
with the range of r usually encountered in red cell hemacytometry 
and erythrocytes being as big as they are, it happens that a number of 
squares get quickly filled and this results in a more even dispersion of 
cells throughout the counting chamber. This, obviously, results in a 
lessened observed variance than would be predicted by the binomial- 
Poisson model. 
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3. An Improved Model 


We will discuss in this paper a model which takes into account the 
above-mentioned crowding due to the finite limitations of space in the 
counting chamber.* A derivation can be made as follows. 

Let us suppose that NV is the maximum number of cells that can be 
accommodated under the entire grid, assuming no over-lapping of cells. 
We denote the maximum number of cells which can be accommodated 
under one square of the grid as a. Obviously a = N/n since each of 
the n squares has equal area. We desire the probability, P(ki , ky. , 
--- ,k,) of obtaining exactly k, , k, , --- , k, cells in the m squares of 
the grid. 

Now, the total number of ways in which the r cells can be arranged 


; : Ry ee 2 
in the N available spaces is e , since each space can contain only one 


cell. We assume that each of these ways has an equal chance of occurr- 
ing. The number of ways that k, cells can be arranged in the a spaces 
available in the first square, k, cells in the second, and so on is 


(ls eata 


Then, the desired probability, of obtaining exactly k, , k. , --+ , kn 
cells in the n squares is given by 
At) 
kn 


(eA) 
ky ke 

“ 
where the sum of the k’s is equal to r. 

It is tacitly implied by this derivation that all of the r cells are of 
the same size. Of course, erythrocytes, like all biological products, 
vary somewhat in size; however, it seems likely that the model (4) 
will still serve as a reasonably good approximate model. The adequacy 
of the approximation, as in all scientific models, must be determined 


by the ability of the model to describe approximately all relevant 
details of the observational record. 


*We are obligated to Dr. E. J. Williams for suggesting the model discussed here. The authors 
had employed the hypergeometric distribution given by 


re = (ME /) 


in an earlier draft of this paper. This latter was not so general in that the correlation between squares 
which had been deemed negligible, was not taken into account. Formula (5) below was also rroviied 
by Dr. Williams. 
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It can be shown that the expected value of the estimated variance 
of k; (@ = 1, 2, --- , n) is given by 


eee | hae n 2 
Sw) ov (5) 


where yu is the expected mean, r/n. When N is large, the coefficient 
of uw in the first term of formula (5) becomes close to unity so that the 
variance can be approximately expressed by 


o 


od Sienna (6) 


This is to be compared with the corresponding expression for the Poisson 
distribution, 


o= 4 (7) 


Formula (5) should be compared with the quadratic relationship derived 
by Lancaster (1950) by a non-distributional argument. In his approach 
to the crowding problem Lancaster was working with larger subdivisions 
of the hemacytometer grid than we have employed and his derivation 
was geometrical in nature. It is interesting that these two independent 
developments have produced theoretical relationships between the 
mean and variance of the same algebraic form. 


4. Test of the Model 


Formula (6) provides a convenient way of testing the adequacy of 
the suggested model. We do not know the value of N but the constant 
n/(N —1) can be estimated from actual data by fitting the quadratic 
equation (6). Figure 1 shows the data of Berkson [1935] with the best 


FIGURE 1 
RELATIONSHIP BETWEEN THE VARIANCE AND 
THE MEAN CELL COUNT (DATA OF BERKSON) 
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fitting (in the sense of least squares) quadratic curve drawn through 
the points. The straight line drawn at 45° from the axes represents 
the relationship between the mean and variance expected on the basis 
of the Poisson model. The description of the data seems to be quite 
good. 

A similar series of observations was made at Duke University 
under conditions rivaling those of Berkson in accuracy of counting. 
These data are presented in Figure 2, along with the Poisson line and 


FIGURE 2 
RELATIONSHIP BETWEEN THE VARIANCE AND 
THE MEAN CELL COUNT (DUKE SERIES #1) 
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the quadratic relationship for the more general model. Again, the 
description appears to be not at all bad. 

Both the series of Berkson and the Duke series represent obser- 
vations on samples of blood from different individuals. In Figure 3 


FIGURE 3 
RELATIONSHIP BETWEEN THE VARIANCE AND 
THE MEAN CELL COUNT (DUKE SERIES # 2) 
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is presented a series of observations made at Duke collected in a different 
manner. Each mean and variance is based on counts of erythrocytes 
for the same blood sample. The mean number of cells per square has 
been varied by using various degrees of dilution of the single sample 
of blood. The dilutions used were 1:250, 1:225, 1:200, 1:175, and 1:150. 
Two preparations were made at each of these dilutions. Each of the 
ten resulting means and variances is based upon 208 of the small squares 
instead of 400 as in the two previously described series. The description 
by the quadratic curve would be quite good if it were not for the two 
high values. Perhaps these two values represent just random variation, 
but we are inclined to think that their departure from expectation may 
be due to disturbances in the chamber filling process. As we intend to 
show elsewhere, the variance of the hemacytometer count can be 
increased to a considerable extent by interruption in the filling of the 
counting chamber so that a homogeneous distribution of cells in the 
chamber does not result. 


5. Size of the Erythrocytes 


We have attempted to test the tenability of the new model by 
comparing observation with expectation as regards one particular 
consequence of this model, namely, the expected quadratic relationship 
between the variance and the mean. We propose the conclusion that 
the results are not, on the whole, at all inconsistent with the model. 
It certainly appears that the new model provides a better description 
of the presented data than the Poisson model. There is another conse- 
quence of the suggested model which can easily be put to a test. We 
have mentioned the fact that it is the effect of crowding which diminishes 
the variability of the number of cells per square. Now, crowding can 
be produced in two different ways. We can either increase the number 
of cells which compete for the available spaces or we can increase the 
size of the cells. The data previously discussed demonstrate the effect 
of crowding by increasing the number of cells. We now consider the 
alternative situation, the effect of cell size. 

First, we note that it is possible to estimate N directly by use of 
the following relationship which is derived from formula (5) by re- 
arranging terms: 


ee? 
N= “4 = (8) 


aie 


. 2 . 
where we substitute the observed mean and variance for o and uw. This 
is, of course, the “moment” estimate and is, perhaps, not the most 
efficient estimate of N. However, since the maximum likelihood 
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estimator is considerably more difficult to apply, we will use this simple 
estimator to illustrate our point. Now, N is by definition the total 
number of available spaces, so that it is related inversely to the average 
cell size. If we know that we have a sample of blood containing large 
cells we should expect our estimate of N to be smaller than that for 
normal sized cells. The reverse, of course, should hold true if we have 
a sample containing smaller than average cells. Failure of the results 
to comply to this expectation would throw some suspicion on the 
adequacy of the model. 

A sample of blood was obtained from a patient with microcytic 
anemia and from one with macrocytic anemia. The number of cells 
per square was counted for each of the 400 small squares and the mean 
and variance were calculated. These statistics were obtained for both 
patients. N, the number of available spaces, was estimated by formula 
(8) and the results are given in Table 1. The average cell diameter 


TABLE 1 
Patient No. of Mean No. Estimated Mean 
squares of cells Variance N cell diameter corpuscular 
counted (in microns) volume 
Microcytic 400 5.548 5.045 25,100 6.3 65 
Macrocytic 400 3.420 3.242 14,900 8.2 104 


can be approximated from N (and hence from the mean and variance) 
by assuming that the cells each occupy a square with sides equal to 
the diameter of the cell. Thus, the cell diameter in microns is approxi- 
materly given by 


Cell diameter & 1000/-VN. (9) 


The mean corpuscular volume has been computed from the hematocrit 
and cell count for comparative purposes. We are not proposing calcu- 
lation of the cell diameter from the mean and variance as a practical 
procedure; but it appears to us to yield an interesting confirmation of 
the essential validity of the model. The actual average cell diameters 
are not known but the values computed above are of a reasonable 
magnitude for patients of these two types. 


6. Discussion 


The combined effect on the variance of crowding by increasing the 
size of the erythrocytes, which is equivalent to decreasing N, and of 
increasing the density of erythrocytes in the counting chamber, may be 
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FIGURE 4 
RELATIONSHIP BETWEEN THE VARIANCE AND MEAN 
CELL COUNT FOR A VARIETY OF HYPOTHETICAL CASES 
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seen from Figure 4. The range of values, both as to numbers and size, 
is considerably larger than is encountered in actual practice. However, 
this extension of the range makes possible a portrayal of the full impli- 
cations of the suggested model. For example, it is apparent that as 
the density of cells is increased and/or the size of the cells is increased, 
the variance of the count increases to a point, and then with further 
increase of density or size, the variance declines and eventually becomes 
zero when all of the chamber squares are filled to a maximum and hence 
all contain the same number of erythrocytes. The line corresponding 
to N = © is the degenerate case, the Poisson distribution. When N 
is infinite, then the cells must occupy no spaces at all, and hence no 
crowding occurs. Thus, by considering the Poisson as a limiting case 
of the suggested distribution, we realize why this simple distribution 
does not suffice to describe the distribution of such relatively large 
objects as erythrocytes when in considerable density. 

The reader may wonder why we have chosen to test the theory by 
such indirect measures as those we have described. Why not simply 
test the goodness of fit of the theoretical distribution with the observed 
distribution? This was actually done, but it turned out to be a quite 
insensitive mode of discrimination. It happens that several different 
distributions, including both the Poisson and the suggested distribu- 
tion, yield fits to the data which are all quite good by the chi-square 
goodness of fit criterion. However, it is not the overall fit, but rather 
the precision of the count with which we are primarily concerned. It 
is this specific aspect of the description wherein the new model appears 
to be an improvement over the Poisson. 

We will terminate this discussion by a few comments as to the 
practical implications of the above theory. We were motivated in this 
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investigation by the desirability of being able, in actual practice, to 
place confidence limits on cell counts without having actually to bother 
with making a separate estimate of the variance. By assuming that the 
cells were distributed in the counting chamber according to the Poisson 
Law, we were able to do just that. In this case the variance is taken to 
be the count itself. Now, if instead we assume that the law (4) holds, 
then it is not possible to know the variance from the count alone. In 
addition, we must know the size of the erythrocytes; however, this 
information is in general not known. Fortunately, there is a very 
simple solution to this dilemma, one that is apparent from the graphs 
of the relationship between the variance and mean. [If the cell sus- 
pension is sufficiently diluted, then the limiting Poisson distribution 
becomes an adequate approximation to the true distribution and the 
variance of the count can be taken to be the count itself. 

Another solution might be considered, but turns out to be of no 
value for our purposes. This solution would be to increase the density 
of cells to the point that the variance approaches zero. Certainly, we 
should desire the variance to be as small as possible. However, the 
effect of this procedure is to destroy the representativeness of the 
sample in that there would be a piling-up of cells and thus our necessary 
condition of randomness would be lost. 


7. Summary 


Thus, we finally arrive back where we started via a rather devious 
route. We began with the empirical evidence against the use of the 
Poisson distribution. Then we derived a new model with somewhat 
less unrealistic assumptions about the physical system we were studying. 
We saw that this latter model fairly well represented a variety of actual 
data. Then we considered from an intuitive point of view the known 
fact that the Poisson is a limiting form of this latter distribution. We 
then concluded that the Poisson can be made to apply to red cell 
hemacytometry by making greater dilutions of the cells than is at present 
generally done. This conclusion is justified in two different ways. 
First, with sufficient dilution we can use the Poisson distribution; and, 
second, it seems that there is less chance of disturbing the randomness 
of the sample by eliminating a tendency to pile-up. 
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THE COMPARISON OF THE SENSITIVITIES OF SIMILAR 
EXPERIMENTS: APPLICATIONS* 


R. A. BRADLEY AND D. E. W. ScHuMANN** 


Virginia Agricultural Experiment Station of the Virginia Polytechnic Institute, 
Blacksburg, Virginia, U.S. A. 


INTRODUCTION 


The problem of deciding on the best method of measuring the 
effects of experimental treatments is a difficult one in certain areas 
of research. We are concerned with means of comparing the sensi- 
tivities of two available experimental techniques, where we use the 
term in a very general sense, that might be used to yield data for 
specified treatment comparisons. It is often not possible to determine 
on a priori grounds which of two techniques should be used and it may 
be desirable to use both techniques in separate experiments with a 
view to deciding on the technique to be used in future experimentation. 
When the two experiments are indeed independent of each other in 
probability, the methods of this paper may be used. 

Let us consider several typical situations indicated by subject- 
matter headings. 

Taste Testing: Hopkins [1950, 1953] investigated the use of discrete 
scoring scales in taste testing experiments as they are commonly used 
and Baten [1946] suggested the use of a continuous line scoring system. 
The latter obtained results which he interpreted as showing an increase 
in sensitivity for his method over the use of discrete scores. Procedures 
discussed in this paper provide means of comparing the sensitivities 
of the two scoring systems if data were available from two identically 
designed but statistically independent experiments, one for each 
scoring system, in a form suitable for analysis of variance. 

Many other comparable situations arise in taste testing. The 
sensitivities of two taste panels, two methods of preparation of test 
items, two environmental situations during tasting, or two methods of 
storage may be compared. 


*Research sponsored by the Agricultural Research Service, U.S.D.A. under a Research and 
Marketing Act Contract, No. 12-14-100-126(20). 


**Now with the Department of Statistics, University of Stellenbosch, Stellenbosch, South Africa. 
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Chemistry: Consider the analytical chemist with two series of re- 
actions based on the use of different reagents available fora quantitative 
analysis of a particular element in a compound. His ultimate objective 
is to run comparative experiments on the concentrations of this element 
in samples of the compound obtained from a number of suppliers. Two 
similar experiments might be conducted, one for each analytical method ; 
to determine which method is to be used in the required routine experi- 
ments. Other problems of this nature arise frequently in chemistry 
and all of the physical sciences. 

Life Testing: Means of conducting accelerated life tests are being 
devised in electronics, food processing, nuclear physics, and in a host 
of other research endeavors. In some cases it may be possible to 
compare experiments based on accelerated tests with experiments 
depending on normal life tests. In other cases alternative means of 
conducting accelerated tests may be available and comparisons of their 
sensitivities in the sense of emphasizing real treatment differences are 
required. 

Agronomy: One of the difficult areas of agronomic research is in 
pasture experimentation. Yields of various forage mixtures may be 
measured through clipping of plots in some experiments while in others 
animal performance or carrying capacity may be taken as the measure 
of yield. While care should be taken to determine that these yield 
measurements may be transformed to comparable units, the need for 
comparison of the sensitivities of techniques relative to the exhibition 
of differences between measurable forage characteristics is again apparent. 

Cochran [1943] discussed the problem of comparing different scales 
of measurement for experimental results. He assumed that analysis 
of variance techniques were applicable and confined his attention to the 
case in which all scales measure the same experiment. It was noted 
that a comparison of the sensitivities of two scales should depend both 
on the experimental errors associated with them and on the magnitudes 
of the treatment effects in the scales. Cochran suggested that a com- 
parison of this sort should depend on a test of significance of the hypoth- 
esis that the parameters of two noncentral variance-ratio distributions 
are equal. Schumann and Bradley [1957] have provided means of making 
such tests of significance for comparisons of the sensitivities of two 
identically designed independent experiments. The main results are 
then for identical experiments in the sense that both experiments yield 
F-tests with the same degrees of freedom and with treatment means 
based on the same number of replications although generalizations are 
indicated. We shall summarize that work in the next section and devote 
the rest of this paper to numerical illustrations. 
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_ We may have somewhat generalized Cochran’s concept of comparing 
scales of measurement in discussing comparisons of experimental 
techniques. Now some caution is in order in interpreting the results 
on comparisons of experiments. The test procedures developed are 
correct whenever two independent variance ratios are compared. How- 
ever, the experimental techniques under comparison should be such 
that they do not interact with the treatments under comparison in a 
way not explainable in terms of a shift in treatment locations and a 
magnification of differences in treatment effects. It would be a good 
rule to forego experiment sensitivity comparisons if there is evidence 
of treatment by experiment interaction of a nature other than that 
permitted in view of our preceding statement. 

Many experiments are conducted with a view to examining inter- 
actions. To illustrate, suppose that two independent experiments in 
agriculture differ only in geographical location. Then it is more likely 
that the experimenter will be interested in treatment by location 
interactions than in location sensitivity differences to treatment effects. 
The important applications of our methods seem to be in comparing 
experiments based on experimental techniques differing essentially in 
means of measurement which do not interact fundamentally with the 
treatments under comparison. It is of course obvious, if two experi- 
mental techniques are to be compared, that the comparison should 
not be confounded with other experimental changes. 

The method for the comparison of the sensitivities of experiments 
may also be used to compare two population multiple correlation 
coefficients whose sampling distributions are based on regression 
models. - An example of this method will be given. 


DISTRIBUTION THEORY 


The following summary of the distribution theory for the comparison 
of the sensitivities of two experiments is based on the work of Schumann 
{1956] and Schumann and Bradley [1957]. 

Let Ff, and F, be independent noncentral variance ratios, each 
with 2a and 2 degrees of freedom and with parameters of noncentrality, 
A, and AX, . Then their distributions are of the form 


{(F;) a (a/b)*[Ba, De a2 Ul al et 
Fila + b, a, ad.F,/b(1 + aF,/b)],0 < Fs < @, i = 1,2, 


(1) 


where 6 represents the beta function and ,F, , the confluent hyper- 
geometric series. In Model I of the analysis of variance with so-called 
“fixed” parameters in the additive model, if 7,; is the effect of the j-th 
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of ¢ treatments in experiment 7 and if o? is the i-th population experi- 
mental error variance, 


Ms =k DT ri,/2a% , i= 1,2, (2) 


when k is the number of observations in each treatment mean. These 
parameters of noncentrality are then measures dependent upon both 
the experimental error associated with the experiment and the magni- 
tudes of treatment effects in the experiment. When d,; = 0, f(F;) in 
(1) becomes the density function of a central variance ratio with 2a 
and 2b degrees of freedom. The specification of degrees of freedom 
as 2a (for the numerator of F;) and 2b (for the denominator of F;) 
was for mathematical convenience; a and b may be integers or half- 
integers. 
We required the distribution of 


w = F,/F, (3) 


and its density function is 


g(w;a,b,») =e DX Ba +r, WB +s, HY" & 
r=0 s=0 Oe 4 


wi A(w; r, 8), 0<w < @ , 
when 
Ai =A, =A (5) 


and where 


ao 


Hews) = fy + wy) Pd + ye dy, 
Ui SOS Tae 
BQa+rt+s, 2b).Filatb+r,2a+r+s, (6) 
ee ehrar as = wy, “0. ss 1, 
= p %*"*) Bq +r + 8, 2b) 
Fila + b+s,2a+r+s,2a+ 2+r+s, (w— 1)/v], 
bx iio. 


fF, in (6) is the hypergeometric function. It has been noted that 
g(w; a, b, \) has a mode at a value of w between 0 and 1, a median at 
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w = 1 and that lim g(w) = 0asw > O and asw— o. Further, we 
use the fact that 


g(w; a, b, X) = g(1/w; a, b, d). (7) 
We have shown that the distribution function 
Glwe 54, bX) = Pew < wila, b,») = [gw a,b, d) dw 
0 


may be approximated by the distribution function, 


G(wo; a’, b, 0) = [Baa’, or f yd + yy (woy)*(L + wey) 


(8) 
ice Ee ees ely 
a (a + 1) + wy) © 2a’ + 2)(1 + wy)” 
where 
a’ = (a + d)*/(a + 2d). (9) 


The suitability of this approximation was determined on the basis of 
comparing moments of g(w; a, b, \) and of g(w; a’, b, 0) and a limited 
number of values of G(w, ; a, b, X) and G(w, ; a’, b, 0). 

A table of values of w,(.05) for which G[w,(.05); a, b, 0] = .95 was 
prepared by Schumann and Bradley [1957, Table I] for integer and 
half-integer values of a, a = 1, --- , 21/2, 15 and for integer values 
of b, b = 1, --- , 10, 15, ©. This table is reproduced here as Table 1 
for the convenience of the reader. The table is essentially triangular 
for G(wz ; a, b, 0) = G(w, ; b, a, 0). It is apparent from the table that 
values of G(wy ; a, b, X) and G(w, ; a’, b, 0) are fairly stable even for 
considerable variation in values of \ and for quite small values of a 
and b. This implies that it will have little effect in applications if we 
enter the table using a value of \ somewhat different from its true 
(and usually unknown) value. Linear interpolation in Table 1 will 
be satisfactory in applications. 

In usual multiple regression theory with non-stochastic independent 
variables, if we fit two regression equations of the form 


Vo aie 1, (10) 


to two independent sets of data and obtain corresponding multiple 
correlation coefficients, R; and R, , based on N observation vectors 


each, 
F; = bRi/al — Ri), —t = 1,2 (11) 
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have distributions of the form (1) with 
A, = (a + b)pi/ — #3) 
a= p/2 (12) 
b = (N — p — 1)/2 


where p; , 7 = 1, 2, is the population multiple correlation coefficient 
corresponding to R, . It then follows that the procedures developed 
apply here and that we may perform tests of hypotheses on multiple 
correlation coefficients dependent on those procedures. It is common 
practice to use R” as a measure of the fraction of the variability in the 
dependent variable explained by regression. It will be useful in many 
cases to be able to test hypotheses on the equality of pj and p3 . 

Note that the application to comparing multiple’ correlation co- 
efficients is suitable for the regression model (10) but does not apply 
to correlations based on models assuming a multivariate normal distri- 
bution for vectors (y, 4, , -:* , %»), for then the distribution (1) is not 
correct. 


TEST PROCEDURES 


(z) Tests of hypotheses on sensitivity 

In tests of sensitivity we shall use the statistic w = F,/F, defined 
in (8) and on the assumptions stated earlier that F, and F, are in- 
dependent, have 2a and 2b degrees of freedom, and have noncentral 
F-distributions with parameters \,; and A, . Wo(a) is defined to be 
such that G[w,(a); a, b, A) = 1 — a. Equality of \, and d, implies 
equal sensitivities for the two experiments so long as k in (2) is the 
same for each experiment, that is, so long as treatment means are 
based on equal numbers of replications. 


Test ihe ae 2 ON — re = r,; is. BN = Ae ° (13) 


The null hypothesis H, represents the hypothesis of equal sensitivities 
for the two identical experiments. The alternative hypothesis H,, 
assumes that we are interested in the one-sided test perhaps because 
we shall not change to the method of Experiment 1 from the currently 
used method of Experiment 2 unless the former is more sensitive. 


Given a, we obtain w)(a) and reject H, in favor of H., at significance 
level a when w > w,(a). 


Test oe He AG — de — AG He 3A < do . (14) 


This test is identical with Test 1 when the identities of the experiments 
are interchanged. 
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Test 3. He SON = NG a r, Ue tea NG pa Vo . (15) 


The two-sided test is of course similar to Tests 1 and 2 and would 
be used when departures from H, in either direction are of interest. 
For significance level 2a, reject Ho in favor of H,, when w > W (a) 
or when 1/w > wy(a). 

(i) Tests of hypotheses on correlation 


est ic eb parm aars in Paps ps (16) 
espa AT Siig opt =p. aa He stp, <p; ; (17) 
Test Ge) Hosp, Sane Hsp op: (18) 


Tests 4, 5, and 6 may be associated with Tests 1, 2, and 3 respectively 
when we redefine 


w = Ril — F)/Rx(1 — Ri) (19) 


with the association of the parameters in (12). With this new definition 
of w, the test procedures are exactly those used in (¢) above for tests 
of hypotheses on sensitivity. It is of course necessary that (a + b) 
in (12) be constant for the two regression experiments. 

(iit) Analysis 

The following easy basic steps are required in the tests outlined 
above. 

1. Do analyses of variance for both experiments yielding F-statistics 
valid for treatment comparisons, each with 2a and 2b degrees of freedom. 
[When analysis of variance for multiple regression is used, FP = 
bR?/a(1 — R’)]. 

2. Estimate the parameters \, and i, pertaining to the two experi- 
ments. 

(a) In Model I of the analysis of variance, estimate \ for each 
experiment by taking 

2 
A = ol = 1] (20) 
where s? and s? are respectively mean squares for treatments and for 
error. The mean of 4, and 4, is heuristically taken to be the ‘‘best 
available” single estimate* of the population parameter i. 
(b) In multiple regression, Snedecor [1950, p. 348] gives 


Ri=1-—(1— RP )(a+ b)/b (21) 


*The conservative statistician might take } to be the smaller of 1 and 2 ; the ultra-conservative 
statistician might take } = 0. For moderately large a and b, wo (a) does not change much with changes 


ind. 
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in our notation and R2, may be taken as an estimate of p required 
to estimate \ in (12). This is however equivalent to taking 


2 
— of _ | (22) 
Sres 
similar to (20) where s?., and s<., are respectively mean squares for 
regression and residual (error) variation in the analysis of variance 
for regression. Then again estimates of \, and ), are obtained from 

the two experiments and their mean is taken as i. 

3. Formulate the null hypothesis Hj :\; = 2» = A, using 4 for X, 
and the suitable alternative hypothesis selected from Tests 1-6. 

4. Decide on the significance level to be used, @ for a one-sided 
test and 2a for a two-sided test.* 

5. Calculate w = F,/F.. 

6. Compute a’ = (a + 4)’/(a + 2h) using X obtained in Step 2. 

7. Obtain wo(a) from Table 1, possible when a = .05, by entering 
that table with a’ for a and with b and by using linear interpolation 
in the table. 

8. Compare w with w (a) rejecting H, in favor of H, in accordance 
with the rules set forth for Tests 1-6. 


NUMERICAL EXAMPLES 


The following numerical examples have been chosen to illustrate 
applications of the method of comparing sensitivities of identical 
experiments in different experimental situations. In summarizing the 
analyses we follow the steps outlined in the preceding section. 

Example 1. Discriminating Abilities of Judges. 

Two Judges were asked to score desirability of color of nine brands 
of canned tomatoes in two independent randomized block experiments. 
The discrete scoring scale employed had values ranging in units of 5 
from 0 to 100. Although the scale was discrete, it will be assumed that 


TABLE 2 
AVERAGE CoLor Scores ror CANNED ToMATOES 

Brand 2, 5 9 3 
Judge A (Exp. 1) Samo 85.0 TAS. Ul 75.0 
Judge B (Exp. 2) 15.0 68.7 65.0 60.0 

Brand 1 7 4 8 6 
Judge A Giles 50.0 48.7 48.7 AWE ta) 
Judge B 52-5 45.0 40.0 40.0 38.7 


*At the present time we are limited to taking « = .05 due to the unavailability of other tables. 
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analysis of variance applies and the observed scores were used in such 
analyses. Judge A was to be used on a scoring panel if he demonstrated 
sensitivity to color differences as well as the experienced Judge B. 

The nine brands of canned tomatoes with the average scores accorded 
each, averaged over four replications, for each judge are given in 
Table 2. 

The analysis for comparison of the sensitivities (discriminating 
abilities) of the two judges now follows: 

1. Analyses of variance are given in Table 3. F, = 19.14 and 
F, = 52 while 2a = 8, 2b = 24,a = 4,6 = 12. 


TABLE 3 
ANALYSES OF VARIANCE FOR THE Two JUDGES 


Source df. Mean Square F 


Judge A (Experiment 1) 


Blocks 3 167.6 
Treatments 8 4a LOMA 
Error 24 59.3 

Judge B (Experiment 2) 
Blocks 3 346.3 
Treatments 8 761.6 iS apse 
Error 24 140.6 


**Significant at the .01 level of significance. 


2. We use method (a), equation (20). 
Ai = a| He _ | a2 De 


59.3 


761.6 o 
hy = 4| 78 = | = 17.7. 


We take \ = 3(17.7 + 72.5) = 46.1. 

3. The hypotheses are Hy : i = Ao = A, Ha, : Ai > Az as described 
in Test 1. 

4, It is specified that a = .05. 

5. w = 19.14/5.42 = 3.58. 

6. a’ = (4 + 45.1)’/(4 + 90.2) = 25.6 from (9). 

7. w(.05) < 2.87, obvious from Table 1 with a’ = 25.6, b = 12 
although a’ exceeds the tabled values of a in Table 1. 

8. H, is rejected in favor of H,, on the basis of the test for 
w > w,(.05). This implies that Judge A is a good judge of color differ- 
ences even in comparison with experienced Judge B and Judge A may 
be included on the scoring panel. 
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Note that if we had taken } = {, = 17.7, we would have obtained 
a’ = (4 + 17.7)?/(4 + 35.4) = 12.0 and w,(.05) & 2.60. Our con- 
clusions would of course be unchanged. 

Examination of Table 2 suggests that the two judges are not using 
the scoring scale in quite the same way but that the difference is largely 
due to location and scale-unit differences. There is no evidence of 
judge by treatment interaction of the sort that would make interpre- 
tation of the w-test used here difficult. In most cases it would be neces- 
sary to train Judge A in use of the scoring scale consistent with that 
of the other panel members before he may be included on the panel. 

Example 2. Sensitivity of Pasture Yields to Irrigation. 

The yields of a mixture of Orchard Grass and Ladino Clover sub- 
jected to eight different treatments (eight fertilizers) were obtained 
from plots which were either irrigated or not irrigated. The experiments 
relating to irrigation and no irrigation were independent and each 
contained four replications. The average yields in pounds per acre 
for treatments A to H, averaged over the four replications, are given 
in Table 4. An objective was to learn whether the effects of differences 
in fertilization would be more or less pronounced under irrigation 
than under no irrigation. 

Many irrigation experiments of the kind described have been 
conducted at the Virginia Agricultural Experiment Station and little 
treatment by irrigation interaction has been obtained that is of a 
fundamental nature (that is, that cannot be explained by a shift in 
treatment locations and a magnification of treatment differences due 
to irrigation). If, for example, we plot pairs of observations in columns 
of Table 4, we find the points so obtained very nearly collinear and this 
is a crude but adequate check on the explanation of the treatment by 
irrigation interaction. 

Sometimes split-plot designs are used in this sort of irrigation 
experiment, but this is not usually acceptable since error variances for 
irrigated plots are often eight to ten times as large (only 2.7 times as 
large in our example) as for plots without irrigation. This same hetero- 
geneity of error variances also largely precludes the possibility of 


TABLE 4 


Mean YIELDs oF ORcHARD Grass AND Lapino Clover IN Lps/AcRE UNDER 
FERTILIZER TREATMENTS 


Treatment A B © D E F G H 


No irrigation (Exp. 1) 499 627 642 756 884 494 597 377 
Trrigation (Exp. 2) 831 836 931 1137 1346 730 903 583 
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looking at tables of means such as Table 4 in order to compare the 
sensitivities of the two growing conditions to treatment differences. 
The split-plot design is not necessary (whole plot treatments were 
“irrigation” and “no irrigation’) for in most experiments in most 
years huge differences between irrigation and no irrigation are found. 

The analysis for comparison of the sensitivities of no-irrigation and 
irrigation experiments to differences in fertilizer treatments is now set 
out. 

1. F, = 8.86, F, = 7.10, a = 3.5, and b = 10.5 from analyses of 
variance in Table 5. 

2. dy = 3.5(8.86 — 1) = 27.5 and i, = 3.5(7.10 — 1) = 21.4 and we 
take } = 4(27.5 + 21.4) = 24.4 using (20) and Step 2. 

3. Ho: = A. = A and A,, : \, ¥ dz were chosen in accordance 
with the objective of the experiments. 

4. We take a = .05 and the significance level is .10. 

5. w =) 8.86/7,10 = 1.25. 

6. a’ = (3.5 + 24.4)?/(3.5 + 48.8) = 14.9 from (9). 

7. We use the result that G(w, ; a, b, 0) = G(w, ; b, a, 0) and enter 
Table 1 with a = 10.5 (in our experiments b = 10.5) and with b = 14.9 
(we obtained a’ = 14.9). Then we find that 2.36 < wy(.05) < 2.63. 


TABLE 5 
ANALYSES OF VARIANCE FOR THE Two FERTILIZER EXPERIMENTS 


Source role Mean Square Fr 
Experiment 1—No Irrigation 
Replications 3 17,899 
Treatments 7 102,104 8.86** 
Error 21 WMS Li 
Experiment 2—Irrigation 
Replications 3 93,155 
Treatments 7 224 , 462 Th MO" 
Error 21 31,599 


**Significant at the .01 level of significance. 


8. w is not significant in comparison with w,(.05) nor is 1/w. It 
does not appear that these irrigation and no-irrigation experiments 
differ much in sensitivity to fertilizer treatment effects. (While we 
took, \ = 24.4, the same conclusion would have been reached for other 
values of X in a fairly wide interval about 24.4.) . 

Example 3. A Comparison of Two Multiple Correlation Coefficients. 

In research in chemical engineering on an extraction mechanism, 
the rate of transfer of one component of a two-component, two-phase 
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system (water and a solvent) across the interface was measured.* 
This rate of transfer was correlated with the physical properties of the 
phases. For any one system, log (k,d/D,,) or y was assumed to be 
linearly related to log (G,d/u,) or x; and log (@.,d/u..) or 2 as a result 
of a multiplicative model based on theoretical knowledge of the mech- 
anism. The symbols used are: 


k,: The film coefficient for transfer of water into the solvent 
phase, lb. moles per hr., sq. ft. unit concentration difference. 

d: The diameter of the extraction tube, ft. 

D,, : The diffusivity of water into solvent, sq. ft. per hr. 

G,: The mass velocity of the solvent phase, lb. per hr., sq. ft. 

G,, : The mass velocity of the water phase, lb. per hr., sq. ft. 

u,: The viscosity of the solvent phase, lb. per ft., hr. 

nu»: The viscosity of the water phase, Ib. per ft., hr. 


For this example, we consider regression equations of the form 
y=at+ bit, + dete 


for two solvents, cyclohexanol and methyl ethyl ketone. The inde- 
pendent variables, x, and x, , are measurable with small errors that are 
insignificant in comparison with the residual variability of y. <Ac- 
cordingly, the regression equations and the resulting estimates of the 
multiple correlation coefficients are appropriate to illustrate the com- 
parison described in this paper. We are interested in comparing the 
dependencies of y, essentially a transfer rate, on the physical properties 
of the phases for the two different solvents. The experimental results 
and the statistical analysis are now set forth. 

The regression equations are: 

1. Cyclohexanol, 


y = 1.0986 + 1.09312, + 0.17362, 


and 
2. Methyl ethyl ketone, 


y = —2.8715 + 0.87122, + 0.52342, . 


The corresponding squared multiple correlation coefficients are 
R; = 0.571 and R; = 0.486 with 2 and 34 and 2 and 31 degrees of 
freedom respectively. 


The two regression equations look rather different but the objective 


*We are indebted to Professor N. F. Murphy of the Department of Chemical Engineering, Virginia 


Polytechnic Institute, for permission to use his data and for advice on the presentation of this example. 
The regression equations in this example were calculated from work on liquid-liquid extraction given 


by Murphy, Lastovieca, and Skrzee [1956] which contains similar, but somewhat more complex, re- 
gression analyses. ‘ 
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is simply to compare values of p* as measures of the effectiveness of 
the regressions. This can be done here since the degrees of freedom are 
almost the same for the two studies and this is required in view of the 
definition of \ in (12). 

The analysis is: 

Vela e re 108s (1) — BR.) -= 0.0455 with a= and 
b, = 17, b, = 15.5. We take 6 = (17 + 15.5)/2 = 16.25, the average 
of the values of 6 for Rj and R?2 . 

2. Using Step 2(b), formula (21), we have 


1 — (1 — 0.571)(1 + 17)/17 = 0.546, 
= 1 — (1 — 0.486)(1 + 15.5)/15.5 = 0.453. 
Then, from (12), 

Ai, = (1 + 17)(0.546)/(1 — 0.546) = 21.647 


> 
LS] Laat 
ll 


D> 


and 


A 


(1 + 15.5)(0.453)/(1 — 0.453) = 13.665. 
We take 


\ = 4(21.647 + 13.665) = 17.66. 


I 


3. The hypotheses are H, : pi = pz and H,, : p? ¥ p?. 
4. a = 0.05; the significance level is 2a = 0.10. 


5. w = 1.3310/0.9455 = 1.408 from (19). 
6. a’ = (1 + 17.66)?/[1 + 2(17.66)] = 9.59. 
7. It is evident from Table 1 that w,(.05) & 2.67 and certainly 


W,(.05) > 2.17 when we enter that table with a = 9.59 and b = b = 16.25. 
8. The observed value of w, 1.408, is exceeded by wo(.05) > 2.17 
and we do not reject H, : p; = pz formulated in Step 3 above. 


DISCUSSION AND SUMMARY 


We have illustrated a method of comparing the sensitivities of two 
identically designed experiments and of comparing the squares of two 
multiple correlation coefficients from comparable regression analyses. 
In each case, the problem reduced to a test of the hypothesis of equality 
of two parameters of noncentrality in two noncentral /-distributions. 
This is satisfactory when the number of replications in each treatment 
mean is the same in each analysis of variance and when each regression 
equation is based on the same number of observations. 

Examples where these methods may be used have been pointed out 
in a general way in a number of areas of research. Three particular 
numerical examples are given illustrating the application of the steps 


510 BIOMETRICS, DECEMBER 1957 


for analysis specifically set forth. Some discussion of the care needed 
in interpreting these analyses is given and in particular it is pointed 
out that treatment by experiment interactions of a fundamental nature 
may lead to difficulties of interpretation. The table available for the 
tests of significance required in analyses is somewhat limited (this is 
indicated from the way the examples are handled) but Schumann is 
planning to extend these tables and to prepare tables for values of a, 
the significance level in one-sided tests, of .025, 0.01, and .005. Neverthe- 
less, the available table with a = .05 does provide a means of making 
the tests that are often desired in experimental work. 

There are some situations wherein the sensitivities of experiments 
that do not meet the requirements on similarity set forth should be 
compared. The practical situations seem to be where the same treat- 
ments are in both experiments but where the experiments are based 
on different numbers of replications or on somewhat different experi- 
mental designs. A crude method of obtaining an approximation to 
the critical value of w in that situation based on the use of Table 1 has 
been suggested by Schumann and Bradley [1957] but its accuracy is 
not known and it is not repeated here. 

In conclusion, D. E. W. Schumann wishes to express his appreciation 
of a grant from the South African Council of Industrial and Scientific 
Research, Pretoria, South Africa, and for assistance from Fulbright 
and U.S. Department of State grants. These grants and the assistance 
of the Agricultural Research Service, U.S.D.A., made this work possible. 

Weare grateful to Mrs. Sally Arnold, who did much of the computing, 
and to Mrs. H. P. Ellison, Miss Marianne Byrd, and Mrs. J. D. Hall for 
clerical assistance. 
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ERROR RATES AND SAMPLE SIZES FOR RANGE TESTS IN 
MULTIPLE COMPARISONS 


H. Leon Harter 


Wright Air Development Center 
Wright-Patterson Air Force Base, Ohio, U. S. A. 


1. Introduction 


The problem of multiple comparisons has recently aroused a great 
deal of interest among statisticians. The basic F-test in an analysis 
of variance determines whether there is a significant difference among 
a group of means, but it cannot tell which means differ significantly 
from which others. The latter is often what the investigator really 
wants to know. Various multiple comparisons tests, including the 
range tests discussed in this paper, have been proposed. A study is 
made here of the error rates, a and #8, and their relation to sample size, 
N, for three fixed range tests and three multiple range tests. 

Let it be required to test the significance of the range of p out of 
m ordered means of samples of size N, where p = 2,3, ---,m. For the 
fixed range tests, the critical range for a particular Type I error rate, 
a, depends only on m and N; for the multiple range tests, it depends 
also on p. In fact, for fixed m and N, it is a non-decreasing function of 
p. The range of p means is said to be significant or non-significant 
(at the a level) according as it does or does not exceed the critical range, 
except that the range of p means is automatically non-significant if 
these p means constitute a subgroup of a larger group whose range is 
non-significant. Because of this exception, a multiple range test starts 
with a test on all m means. If the range of all m means is found to be 
significant, then tests are performed on the ranges of (m —1) successive 
means, (m —2) successive means, and so on until significant differences 


are no longer found. 


2. Significance vs. Confidence 


Though the emphasis here is upon tests of significance of size a 
for the difference between sample means, each of the above methods 
can also be used for setting 100 (1 —a)% confidence limits on the 
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difference between population means. The choice between significance 
tests and confidence limits depends upon the experimental situation 
and upon the experimenter’s philosophy. Many statisticians, in- 
cluding Tukey [1953], prefer confidence procedures in practically all 
cases, and the author is convinced that there is much merit in this 
position. The results of this paper, however, will be stated in terms 
of significance. The reader who prefers confidence procedures can 
easily translate from the language of significance to that of confidence. 


3. The Fixed Range Tests 


The three fixed range tests which will be considered are the least 
significant difference (LSD) test based upon the work of “Student” 
[1908], a modification of this proposed by Fisher [1935], and the student- 
ized range test proposed by Tukey [1952]. They will be designated 
by code letters L, H and W respectively. 

The differences LSD, WSD and ESD between any two out of m 
means of samples of size N required for significance (at the a level) 
by the LSD test, Tukey’s studentized range test, and Fisher’s test, 
respectively, are given by 


LSD = ta, »)sz = ga, 2; »)s ; (1) 
WSD = qa, M, v)Sz , (2) 
BSD = t(@/C; ; 7)s3 = ¢@/Cs » 2,8, (3) 


where » is the number of degrees of freedom for the error mean square, 
s’ ; t(a, v) is the 2-tailed a point (upper a/2 point) of Student’s ¢ with 
v degrees of freedom; g(a, 7, v) is the upper a point of the studentized 
range of r observations with v degrees of freedom for s; sz is the standard 
error of the mean, sz; = ~/s?/N; and sj is the standard error of the 
difference between means, sz = V/2s’/N = V2s-. 

Each of these three test procedures has Type I error rate a for an 
appropriate definition of the error rate. As a result of a test on the 
means of m samples each of size N, a statement of significance or non- 
significance will be made about each of the C”, differences between pairs 
of means. A wrong statement (Type I error) is a statement that two 
means are significantly different, when actually they are means of 
random samples from the same population. The appropriate error 
rates a, , aw , and az for the respective test types are then defined as 
follows: a, is the expected proportion of wrong statements (error 
rate per comparison); aw is the expected proportion of experiments 
with one or more wrong statements (familywise error rate); and a, 


MULTIPLE COMPARISONS 513 


is the expected number of wrong statements per experiment (error 
rate per family). 

In practice, many statisticians (including the author) use the 
protected LSD test, in which the individual comparisons are made 
only if an analysis of variance yields a significant F-ratio for the com- 
parison of all m means. The Type I error rate, a; , which will be con- 
sidered here, however, is that for the unprotected LSD test, otherwise 
known as the multiple t-test. 


4. The Multiple Range Tests 


The three multiple range tests which will be considered are the 
Newman-Keuls test, Tukey’s X procedure, and Duncan’s new multiple 
range test. They will be designated by the code letters J, X and S, 
respectively. 

The Newman-Keuls test was first proposed by Newman [1939] 
and modified slightly by Keuls [1952]. The critical range of p out of 
m ordered means of samples each of size N is based on the studentized 
range of p observations instead of on the studentized range of m obser- 
vations as in Tukey’s studentized range test. It can easily be seen 
that the critical range ISD for the Newman-Keuls test is intermediate 
between the LSD and the WSD, except that it is the same as the LSD 
for p = 2 and the same as the WSD for p = m. 

Tukey’s X procedure was proposed by Tukey [1953] as a com- 
promise between the Newman-Keuls test and Tukey’s studentized 
range test. The critical range is the arithmetic mean of the critical 
ranges for those two tests. For fixed m and N, the smallest critical 
range (a value midway between the LSD and the WSD) occurs for 
p = 2, and the largest critical range (equal to the WSD) occurs for 
p= mM. 

Duncan’s new multiple range test, proposed by Duncan [1953, 
1955], makes use of special protection levels based upon degrees of 
freedom. Let y2,. = 1 —a@ be the protection level for testing the 
significance of difference between two means, that is, the probability 
that a significant difference between sample means will not be found 
if the population means are equal. Duncan reasons that one has (p — 1) 
degrees of freedom for testing p means, and hence one may make 
(p —1) independent tests, each with protection level y.,, . Hence 
the joint protection level is 


Volt = Yaa aL = a); (4) 


that is, the probability that one finds no significant differences in 
making (p —1) independent tests, each at protection level Y2,4 , 18 
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yz-1. Thus Duncan’s new multiple range test is based upon protection 
levels yp,. for tests on p means. The critical range SSD is a minimum 
(equal to the LSD) for p = 2, and a maximum for p = m. 

The critical ranges ISD, XSD, and SSD for p out of m ordered means 
of samples of size N required for significance (at the a level) by the 
Newman-Keuls test, Tukey’s X procedure, and Duncan’s new multiple 
range test, respectively, are given by 


ISD = qa, p, »)sz , (5) 
XSD = (1/2)[¢@, p, ») + aa, m, »)]sz , (6) 
SSD = R(p, V; Wee Se ’ (7) 


where R(p, v, Yr,2) is the studentized range of p observations with v 
degrees of freedom for s and protection level y,,. given by equation 
(4), and where the other symbols are defined as in the previous section. 


5. Example 


At this point an example will serve to illustrate the various test 
procedures. For the sake of simplicity, a single-classification experi- 
ment involving comparisons of five means will be considered. The 
data represent the weight loss (grams) in four days of rats on five diets 
(no food and 8 calories per rat per day of four foods). In order to 
reduce experimental error, only male rats were used, all rats were of 
the same strain, and their initial weights were nearly constant. All 
one hundred rats received water ad libitum, but they were randomly 
divided into five groups of twenty rats each, one group on each of the 
five diets. 

An analysis of variance of the data yields the following results: 


Source of Variation Dene S.s. M.s. F 
Among diets 4 3012.70 753.18 SAS}, Ue 
Between diets (error) 95 1876.30 . 19.75 

Total 99 4889 .00 


Thus there is a highly significant difference among mean weight 
losses of the rats on the five diets. This information is quite interesting 
to the experimenter, but what he really wishes to know is which diets 
are significantly better than which others. In order to obtain this 


information, it is necessary to apply a multiple comparisons test to the 
means: 
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Purina Laboratory Chow Casein Dextrose Olive Oil No Food 
23.70 24.90 28.25 29.40 39.25 


The standard error of the mean, s= , and the standard error of the 
difference between means, sz , are found to be 0.994 and 1.405, re- 
spectively, based upon the internal estimate of variance, s? = 19.75, 
with 95 degrees of freedom. The factors by which these must be 
multiplied to obtain the critical ranges for significance at the 5% and 
1% levels can be found in tables. Tables of the critical values of the 
Student ¢ distribution can be found in almost any statistics textbook. 
The 5% and 1% points of the studentized range have been tabulated 
by Pearson and Hartley [1954], and the 5% and 1% points of 
R(p, v, Yr,2) by Beyer [1953] and Duncan [1955]. All of these tables 
have been reproduced by Federer [1955], who gives the only textbook 
treatment of the subject of multiple comparisons of which the author is 
aware. The factors by which s, = 0.994 must be multiplied to obtain the 
critical ranges for the various tests are as follows: 


Multiplying the standard error of the mean s; = 0.994 by these factors, 
one finds that the critical ranges for the various tests are as follows: 


a = 0.05 ap = (OI 

Code 

Leics pe PS pe 8 Pe pe 8 pes p=5 
L 2.79 2.79 2.79 2.09 3.70 3.70 3.70 3.70 
S 2.79 2.94 3.04 3.10 3.70 3.85 3.96 4.05 
I 2 3.35 3.68 3.92 3.70 4.19 4.50 4.71 
xX 3.36 3.64 3.80 3.92 4,20 4.45 4.61 4.71 
W 3.92 3.92 3.92 3.92 4.71 4.71 4.71 4.71 
E 4.03 4.03 4.03 4.03 4.77 4.77 4.77 4.77 
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Then the results of the various tests can be summarized as follows, 
where two means are or are not significantly different at the level a 
according as they are not or are underscored by the same line: 


Test a = 0.05 a=0.01 


LSD | 23.70 24.90 28.25 29.40 39.25| 23.70 24.90 28.25 29.40 39.25 
SSD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 30.25 
ISD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
XSD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
WSD] 23.70 24.90 28.25 29.40 39.25| 23.70 24.90 28.25 29.40 39.25 
ESD | 23.70 24.90 28.25 29.40 30.25] 23.70 24.90 28.25 29.40 39.25 


Thus, at the 5% level, the only disagreement is as to whether the 
means for casein and dextrose differ significantly. At the 1% level, 
there are disagreements in regard to the following differences: (1) 
‘casein and olive oil and (2) Purina laboratory chow and dextrose. For 
experiments involving a larger number of means, one would expect a 
greater number of disagreements. 


6. Type I Error Rates 


The Type I error rate, a, is defined differently for the various tests. 
The Type I error rate (as defined for a particular test) is denoted by 
a with the code letter for that test as a subscript; for example, a, is 
the Type I error rate as defined by the LSD test. A comparison of the 
error rates is in order. 

By comparison of (1) with (2), one sees that a WSD with Type 4 
error rate aw is equivalent to an LSD with Type I error rate a, , where 


tar ,) = qlar , 2,r)/W2 = qlaw , m,»)/V2. (8) 


Similarly, by comparison of (1) with (3), one sees that an ESD with 
Type I error rate az is equivalent to an LSD with Type I error rate 
a, , where 


a, = ar/Cy. (9) 


Let az | ay be the value of a; corresponding to a given ay , where 
M may be any one of the letters W, HZ, I, X, or S. Table 1A gives 
az, | aw corresponding to aw = 0.05, 0.01 for m = 2(1)10(2)20, and 
N = 2,3, 4, 6, 10, 16, 25, ~; also ez | ag corresponding to ag = 0.05, 
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0.01 for the same values of m. (The relation between a, and ag is 
independent of NV.) This table, as well as others which will follow, 
has been computed for a single-classification design, and the results 
are slightly different for randomized blocks or other experimental designs, 
due to a slight reduction in the number of degrees of freedom. 

In testing the significance of the range of p out of m ordered means 
of samples of size N by one of the multiple range tests with Type I error 
rate a (as defined for that particular test), the critical range is a function 
of m, N and p. If a, mand N are fixed, the critical range is a function 
of p only. Each critical range, for fixed m and N, corresponds to a 
particular value of the Type I error rate az ; the larger the critical range, 
the smaller the value of a, to which it corresponds. Since the critical 
range is a non-decreasing function of p, the maximum value of a, 
corresponding to a fixed a for the multiple range test under consideration, 
with m and WN also fixed, occurs for p = 2, and the minimum for p = m. 

For p = 2, the Newman-Keuls test and Duncan’s new multiple 
range test have critical ranges equal to the LSD; hence (ax) max | ay is 
equal to a; and (az) max | &s is equal to as . 

For p = m, the Newman-Keuls test and Tukey’s. X procedure 
have critical ranges equal to the WSD; hence (az) min | a is equal to 
a, | aw = a, and (az)min | ax is equal to a, | aw = ax. The values of 
a, | a» = 0.05, 0.01 have been given in Table 1A. 

Table 1B gives (az) max | @x corresponding to ax = 0.05, 0.01 and 
Table 1C gives (@z)min | as corresponding to as = 0.05, 0.01, for the 
same values of m and W as in Table 1A. 


7. Type II and Type III Error Rates 


Let my, and m, be the two extreme values of p out of m ordered 
means of samples of size N. Let wy and uz, be the population means 
corresponding to my and m, respectively. Let the subscripts U and 
L be so assigned that wy > uz. (This does not necessarily imply that 
My > mz.) It should be noted that, while exactly (p —2) of the 
(m —2) other sample means lie between my and m;, , the number of 
other population means lying between wy and yw, may be less than, 
equal to, or greater than (p —2). Now define 6 and 6’ by the relations 


O= (uy = ur)/s, (10) 
6 = (uy = "x)/8; (10’) 


where o is the common population standard deviation and s is the 


pooled estimate obtained from the data. i) 
If the conclusion is drawn that my and m, are not significantly 
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different when actually uy > uz , this constitutes an error of Type Li; 
If m, is found to be significantly greater than my when actually uy > ux, 
this will be called an error of Type III. Figure 1 shows how such an 


TABLE 1 
ComPARISON OF TypPE I Error RATES FOR SINGLE CLASSIFICATION 
A. Values of az, Corresponding to aw = 0.05, 0.01 and to ag = 0.05, 0.01 


az | an 


az, | op = 0.05 = 0.05 
N=2N=3.N =4 N =6.N =10N =16 WW = 255 N--) alls) 


2 | .05000 .05000 .05000 .05000 .05000 .05000 .05000 .05000} .05000 
3 | .02506 .02197 .02093 .02028 .01964 .01943 .01923 .01911) .01667 
4 | .01517 .01254 .01170 .01104 .01070 .01040 .01032 .01023/ .00833 
5 | .01022 .00818 .00746 .00699 .00670 .00654 .00646 .00634| .00500 
6 | .00727 .00569 .00524 .00486 .00465 .00450 .00444 .00438) .00333 
7 | .00542 .00418 .00382 .00356 .00341 .00331 .00327 .00319} .00238 
8 | .00418 .00319 .00294 .00272 .00260 .00255  .00249 .00242) .00179 
9 | .00330 .00251 .00231 .00216 .00202 .00195 .00194 .00191) .00139 
10 | .00269 .00204 .00185 .00173 .00165 .00162 .00161 .00157) .00111 
12 | .00184 .00141 .00129 .00119 .00114 .00112 .00111 .00109) .00076 
14 | .00135 .00103 .00094 .00089 .00085 .00084 .00082 .00080) .00055 
16 | .00103 .00079 .00072 .00067 .00065 .00064 .00063 .00062} .00042 
18 | .00080 .00062 .00057 .00053 .00051 .00050 .00050 .00049) .00033 
20 | .00064 .00050 .00047 .00043 .00041 .00041 .00040 .00040} .00026 


ay | (e4ie 0.01 ay | aR 


N=2N=3N=4N=6N=10 N=16 N=25 N—©| (all N’s) 


2 | .01000 .01000 .01000 .01000 .01000 .01000 .01000 .01000} .01000 
3 | .00497 .00420 .00397 .00383 .00372 .00364 .00360 .00358] .00333 
4 | .00291 .00233 .00215 .00201 .00192 .00188 .00187 .00186| .00167 
5 | .00191 .00146 .00133 .00125 .00119 .00117 .00116 .00114| .00100 
6 | .00134 .00101 .00092 .00085 .00081 .00080 .00079 .00076| .00067 
7 | .00098 .00073 .00066 .00062 .00058 .00057 .00057 .00056] .00048 
8 | .00074 .00055 .00050 .00047 .00044 .00043 .00043 .00042} .00036 
9 | .00058 .00043 .00039 .00036 .00035 .00034 .00034 .00033/ .00028 
10 | .00047 .00034 .00031 .00029 .00028 .00027 .00027 .00026| .00022 
12 | .00031 .00023 .00021 .00020 .00019 .00019 .00018 .00018| .00015 
14 | .00022 .00017 .00016 .00015 .00014 .00014 .00014 .00013] .00011 
16 | .00017 .00013 .00012 .00011 .00011 00011 .00011 .00010] .00008 
18 | .00013 .00010 .00009 .00009 .00008 .00008 .00008 .00008] .00007 
20 | .00010 .00008 .00007 .00007 .00007 .00007 .00007 .00006 00005 
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error may arise. The sum of the Type II error rate, 6, and the Type 
IIT error rate, y, will be denoted by 8’, that is, 


6 =p-+ 7. 


(11) 


The Type III error rate, y, may be sizeable when 6’ and N are both small. 
Otherwise, y = 0 and @’ = 8. 


TABLE 1 (continued) 
B. Maximum Values of a, Corresponding to ax = 0.05, 0.01 


m 


(a@z)max | ax = 0.05 


N=2 N=3 N=4 N=6 N=10 N=16 N=25 Noo 


2 .0500 
3 .0348 
4 .0266 
5 .0217 
6 .0183 
7 .0158 
8 .0140 
9 .0125 
10 .0114 
12 . 0096 
14 0084 
16 0074 
18 . 0067 
20 .0061 
m 
N=2 


.0500 
.0329 
.0249 
.0202 
.0170 
.0148 
.0130 
Oa 7 
.0107 
.0091 
. 0080 
.0071 
.0065 
.0059 


.0500 
.0323 
.0244 
.0197 
.0167 
.0145 
.0129 
.0116 
.0105 
.0090 
.0079 
.0071 
.0064 
.0059 


N=4 N=6 N=10N 


0100 
0062 
0046 
0036 
0031 
0026 
0023 
0021 
0019 
0016 
0014 
0012 
0011 
0010 


.0500 
.0321 
.0240 
.0194 
.0164 
.0143 
.0127 
.0115 
.0104 
.0090 
.0079 
.0071 
0064 
.0059 


.0500 
.0316 
.0238 
.0192 
.0163 
.0142 
.0127 
.0114 
.0104 
.0089 
.0079 
.0071 
.0064 
.0059 


(az) max | ox = 0.01 


0100 
0062 
0045 
0036 
0030 
0026 
0023 
0021 
0019 
0016 
0014 
0012 
0011 
0010 


.0100 
.0061 
0045 
.0036 
.0030 
.0026 
0023 
.0021 
.0019 
.0016 
0014 
.0012 
.0011 
.0010 


.0500 
.0316 
.0236 
.0191 
.0162 
.0142 
.0126 
.0113 
.0104 
.0089 
.0079 
.0071 
0064 
.0059 


.0500 
.0314 
.0235 
201911 
.0162 
.0142 
.0126 
.0113 
.0104 
.0089 
.0079 
0071 
.0065 
.0059 


.0500 
.0314 
.0235 
.0191 
.0162 
.0142 
.0126 
.0114 
.0104 
.0090 
.0079 
.0071 
.0065 
.0059 
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TABLE 1 (continued) 
C. Minimum Values of az Corresponding to ag = 0.05, 0.01 


(az)min | ag = 0.05 


an eS Se ee ee 
N=2 N=3 N=4 N=6 N=10 N=16 N=25 Noo 

2 .0500 .0500 .0500 .0500 .0500 .0500 .0500 .0500 
3 .0500 .0446 .0425 .0411 .0400 .0394 .0392 .0389 
4 .0467 .0397 .0364 .0359 .0344  .0333°- .0331 .0327 
5 .0424 0357 .0334 .0319 .0305 .0299 .0295 .0289 
6 .0406 .0333 .0305 ~=.0287 Ss 0275 —Ss «0268 )~=— 0265 ~— 0259 
7 .0379 =.0310 =.0283-.0266 = 0255 ~=—s «.0249'S's« «.0246—Ss«C«w 02.41 
8 -0360 .0291 .0266 .0247 .0238 .0231 .0228 .0224 
9 -0345 .0276 .0251 .0232 .0222 .0218 .0216 .0212 
10 -0340 .0260 .0237 .0222 .0212 .0207 .0205 ~.0200 
12 .0308 .0239 .0217 .0203 .0193 .0189 .0186 .0182 
14 .0282 .0221_ = .0201_ += «.0189 Ss «.0180 = «.0175 = «0173 —s- «.0168 
16 .0263 .0208 .0189 .0177 .0169 .0165 .0163 .0159 
18 .0245 .0194 .0179 .0166 .0160 .0156 .0154 .0150 
20 .0234 .0186 .0170 .0159 .0151 .0147 .0145 .0141 


(az) min | ase} oS 0.01 
m <a. a a 
N=2 N=3 N=4 N=6 N=10 N=16 N=25 Noo 
rac eee ee ae ee 
2 0100 .0100 .0100 .0100 .0100 .0100 .0100 0100 
3 .0092 .0080 .0075 .0075 .0074 .0074 .0074 0072 
4 .0082 .0066 .0062 .0061 .0061 .0060 .0059 .0058 
5 .0072  .0057 0055S «.0053-—Ss «.0052,—Ss «0050» .0050~—s «. 0049 
6 -0063  .0050 .0049 =.0047 = 0046 )=— 0045S «0044S. 0048 
i 0054 .0045 .0044 .0042 .0040 .0040 .0039 .0038 
8 0048 .0042 .0041 .0038 .0036 .0036 .0035 .0034 


9 -0043 .0089 .0038 ~=.0086 = .0084—s 0083.» «.0033.~—«. 0032 
10 0039 .0037 .0085. += .0033 Ss .0031.—- «.0030. »=—- .0030~—- .0030 
12 0035 .0033 .0031 .0029 .0027 .0027 .0026 .0026 
14 -0033 0029 .0028 »=-.0026 = 0025 = «.0024_-—=Ss «.0024 = 0023 
16 -0031  .0027 .0025 .0024 .0023 .0022 .0022 .0021 
18 -0030 0025 .0024 0022 .0021 .0021 0020 .0019 
20 -0029  .0024 .0022 .0020 .0019 .0019 0019 .0018 


ee eee 


The code letters for the various tests are used as subscripts on 8 
and 8’; for example, 8, denotes the Type II error rate for the LSD test. 

The Type II and Type III error rates, @ and y, and their sum, 8’, 
depend upon a, m, N and 6’. In the ease of multiple range tests, they 
also depend upon 9; for a fixed combination of a, m, N and 6’, B’ isa 
minimum for p = 2 and a maximum for p =m. For p = 2, the New- 
man-Keuls test and Duncan’s new multiple range test have critical 
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FIGURE 1 
GRAPHICAL REPRESENTATION OF THREE TypEs oF ERROR 


ranges equal to the LSD; hence (6f)nin = (84)min = BZ. For p = m, 
the Newman-Keuls test and Tukey’s X procedure have critical ranges 
equal to the WSD; hence (87) max = (Gx)max = Bw. Table 2 gives values 
of BL , (88)max » (8X)min , and Sf. For convenience, these are denoted 
by 61 , 63 , 83 , and 61, respectively. Thus one has the following equa- 
tions and inequalities: 


ie er (12) 
1<8s Sf, (13) 
Pies Bio Ba, (14) 
Bi S bx SB, (15) 
By = Bi. (16) 


In (13), (14), and (15), the left-hand equality holds when p = 2 and 
the right-hand equality when p = m. Error rates, 6g , for Fisher’s 
test have not been computed for m > 2, but they are even larger than 
the corresponding 6‘, . Hence one has the inequality 


be = hi, (17) 
where the equality holds only for m = 2, in which case all six tests are 
identical. 
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The values in Table 2, like those in Table 1, were computed for the 
single-classification case, and the values for randomized blocks and 
other designs are slightly different. 

No attempt has been made to specify in the hypothesis under test 
the values or relative values of population means other than wy and 
uz. This does not affect the fixed range tests, and leads to no error in 
the maximum values of @’, but produces an upward bias in minimum 
values of ’ for the multiple range tests. This is true since even if the 
range of two adjacent sample means is significant, the two will be said 
to differ significantly only if the ranges of all groups containing them 
are significant. Wine [1955] has made a power study of multiple range 
tests in which the relative values of all population means are specified. 
He tabulated the power for tests on 3 and 4 means. For larger values 
of m, his approach appears forbiddingly difficult, and the present 
approach, though not entirely satisfactory, seems to be the only feasible 
one. 


8. Required Sample Sizes 


If a, B, 6’ and mare fixed, the required sample size N for each of the 
fixed range tests can be determined. For multiple range tests, the 
required sample size N depends upon p. If a, 8, 6’ and m are fixed, 
the minimum required sample size N,,;, occurs for p = 2, and the 
maximum required sample size N,,,.. occurs for p = m. 

The code letters for the various tests are used as subscripts on the 
required sample size N; for example, N; denotes the required sample 
size for the LSD test. 

For p = 2, the critical ranges for the Newman-Keuls test and 
Duncan’s new multiple range test are equal to the LSD; hence (Ny) min = 
(Ns)min = Nz. For p = m, the critical ranges for the Newman-Keuls 
test and Tukey’s X procedure are equal to the WSD; hence (Nz) max = 
(N a) mar =N,. 

Table 3 gives Nz , (Ns) max » (Nx)min , and Nw for a = 0.05, 0.01; 
B = 0.10, 0.05; m = 2(1)10(2)20; 6’ = 0.5(0.5)3.0. The required 
sample sizes Nz for Fisher’s test have not been computed, but they are 
slightly larger than the corresponding Ny . 

For small values of 6’, the required sample size N is approximately 
inversely proportional to 6’. Hence, for 6’ < 1, the approximate 
value of Né” may be read from the column headed 6’ = 1.0 in Table 3. 

The values in Table 3 have been computed for the single-classifica- 
tion case. To obtain sample sizes which will ensure that a and B do 
not exceed the prescribed values for randomized blocks, it can be 
shown that it is sufficient (though not always necessary) to add 2 to 
the sample sizes for m = 2 and to add 1 for m > 3. 


TABLE 3 


SamPLE Sizz N ror PRESCRIBED a AND 6 [8’ = 0.5 (0.5) 3.0] 
(SINGLE CLASSIFICATION) 


A. a = 0.05, 8 = 0.10 (P = 0.90) 


f= Uo eal OMe 15 a0 8 he Ba 0 

2 86 23 11 f 5 4 
3 85 22 10 7 5 4 
Nz 4 85 22 10 6 4 4 
5-20 85 22 10 6 4 3 
2 86 23 11 7 5 4 
3 91 24 11 7 5 4 
4 94 24 11 7 5 4 
5 97 25 12 7 5 4 
6 99 25 12 7 5 4 
7 101 26 12 7 5 4 
8 102 26 12 7 5 4 
(Ng)max 9 104 26 12 7 5 4 
10 105 7 12 7 5 4 
12 107 27 12 7 5 4 
14 108 28 13 7 5 4 
16 110 28 13 7 5 4 
18 111 28 13 v4 5 4 
20 112 28 13 8 5 4 
2 86 23 11 7 5 4 
3 96 25 12 i 5 4 
4 102 26 12 7 5 4 
5 106 a7 13 8 5 4 
6 110 28 13 8 5 4 
ye 113 29 13 8 5 4 
8 115 29 14 8 5 4 
(Nx)min 9 ily 30 14 8 6 4 
10 119 30 14 8 6 4 
12 122 31 14 8 6 4 
14 125 32 15 9 6 a 
16 127 32 15 9 6 4 
18 129 33 15 9 6 a 
20 131 33 15 9 6 4 
2 86 23 11 7 5 4 
3 106 28 13 8 6 4 
4 120 ii 14 9 6 5 
5 130 33 15 9 6 5 
ee PG 138 35 16 10 7 2 
7 144 37 17 10 7 5 
8 150 38 17 10 7 5 
9 155 39 18 11 a 5 
10 159 40 18 11 7 5 
12 166 42 19 11 ih 5 
14 173 44 20 12 8 6 
16 178 45 20 12 8 6 
18 183 46 21 12 8 6 
20 187 47 21 12 8 6 


TABLE 3 (continued) 
B. a = 0.05, 8 = 0.05 (P = 0.95) 


m 


Na 


(Ns)max 


(Nx)min 


aor WD 


| 
Ls) 
S 


ANaarkwhd 


9 


142 
146 
149 


155 


106 
129 
143 
154 
163 
170 
176 
181 
186 
194 
201 
206 
211 
216 
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TABLE 3 (continued) 
C. a = 0.01, 8 = 0.10 (P = 0.90) 


mm 


(Ns)max 8 


(Nx)in in 


Nw 


32 15 10 
31 15 9 
31 15 9 
31 14 9 
31 14 8 
30 14 8 
30 14 8 
32 15 10 
33 16 10 
34 16 10 
35 16 10 
35 16 10 
36 16 10 
36 17 10 
37 17 10 
37 17 10 
38 ile 10 
38 17 10 
39 18 10 
39 18 10 
39 18 10 
32 15 10 
34 16 10 
35 17 10 
36 17 10 
244 17 10 
38 17 10 
39 18 10 
39 is 11 
40 18 11 
40 18 ti 
41 19 11 
Al 19 11 
42 19 11 
42 19 11 
39 15 10 
oT 17 11 
AO 19 11 
43 20 12 
A4 20 12 
46 21 12 
47 22 13 
49 22 13 
50 23 13 
51 23 14 
53 24 14 
54 25 4 
55 25 14 
56 26 15 
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TABLE 3 (continued) 
D. a = 0.01, 8 = 0.05 (P = 0.95) 


m ’=05 sf =10 F# =15 Ff =20 F =25 6 = 3.0 

2 145 38 18 11 8 6 

3 144 37 17 al a 6 
4-7 144 37 17 10 7 5 
Nz, 8 143 37 ile 10 a 5 
9-12 143 36 ie 10 it 5 
13-19 143 36 17 10 6 5 
20 148 36 16 10 6 5 
2 145 38 18 11 as) 6 

3 151 39 18 11 8 6 

4 156 40 19 11 8 6 

5 160 41 19 11 8 6 

6 163 42 19 11 8 6 

7 166 42 il 11 8 6 
(N5)max 8 168 43 19 WU 8 6 
9 170 43 20 11 8 6 
10 171 43 20 12 8 6 
12 174 44 20 12 8 6 
14 177 45 20 12 8 6 
16 179 45 20 12 8 6 
18 181 46 21 12 8 6 
20 182 46 21 12 8 6 
2 145 38 18 11 8 6 

3 156 40 19 11 8 6 

4 163 42 19 12 8 6 

5 168 43 20 12 8 6 

6 172 44 20 12 8 6 

o 175 44 20 12 8 6 
(Nx) min 8 7 45 Pil 12 8 6 
9 180 46 21 12 3 6 
10 182 46 21 12 8 6 
12 185 47 21 12 8 6 
14 188 48 22 13 8 6 
16 191 48 22 13 8 6 
18 193 49 22 13 8 6 
20 195 49 22 13 0 6 
2 145 38 18 11 8 6 

3 168 43 20 12 8 ts 

4 183 47 22 13 9 7 
i) 193 49 23 13 ) 7 
Nw 6 202 51 24 14 9 a 
i 209 53 24 14 10 7 

8 215 55 25 15 10 7 

9 220 56 25 15 10 7 
10 225 57 26 15 10 7 
12 233 59 27 15 10 i 
14 240 61 27 16 10 8 
16 245 62 28 16 11 8 
18 250 63 29 16 11 8 
20 255 64 29 17 11 8 
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9. Computation of the Tables 


Let the critical range MSD of a range test on p out of m ordered 
means with significance level ay be represented by 


MSD = Q(x *) P; Mm, v) 8; ) (18) 


where M may be J, X, S, or W and Q (ay , p, m, v) is the significant 
studentized range at the a, level for a test on p out of m ordered means 
with v degrees of freedom for s. The equivalent LSD test has sig- 
nificance level a, such that 


tar, ? v) =e Q(ax > Pp, ™m, v)/ N72: (19) 


It should be noted that (19) is a generalization of (8). While (8) holds 
only for Tukey’s studentized range test, (19) holds for that test and also 
for the multiple range tests. The part of Table 1 concerned with a, 
corresponding to ag = 0.05, 0.01 is computed directly from (9). The 
remainder of Table 1 is computed from (19). One knows Q(ay , p, m, v) 
and must find a; by interpolation from the probability integral of the 
t-distribution, which has been tabulated by Hartley and Pearson 
[1950]. The method of interpolation is linear harmonic v-wise and three- 
point Lagrangian ¢-wise. Fory > 20, t > 4, the Hartley-Pearson tables 
cannot be used. With ¢ > 4, tables of the incomplete Beta-function 
edited by Karl Pearson [1934] were used for y = 21 and for even values 
of y up to and including 100. The probability that ¢ exceeds t, is given by 


PY > to) ao EAL/2, v/2), (20) 
where 
x = v/v + t). (21) 


In one case (v = 27, ¢ > 4), neither of the above methods is applicable, 
but there a method proposed by Fettis [1954] can be used. 

The combined Type II and Type III error rate, 64, , is given by the 
relation 


Bi, = P{t < [Qleu ,p, m,») —  VNI/V2}. (22) 


In computing Table 2, one knows ay, , m, N, p and 4’, and hence also 
the value below which ¢ must lie. The corresponding probability is 
found by use of the Hartley and Pearson [1950] tables of the probability 
integral of the ¢-distribution or one of the other methods mentioned 
above. It would perhaps be better from a theoretical point of view to 
fix 6 rather than 6’, but then in computing tables it would be necessary 
to use the non-central ¢-distribution. Available tables of this distri- 
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bution, including those published recently by Resnikoff and Lieberman 
[1957], are such that this does not appear feasible. 

Table 3 is also computed from equation (22) by one of the above 
methods. Here ay , By, , m, p and 6’ are known, and WN is to be de- 
termined. Since only integral values of N are possible, the value of 
N found by solving equation (22) must be rounded to the next higher 
integer. 
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127 NOTE: SOME SIMPLE METHODS OF COMPUTING 
PARAMETERS IN THE ANALYSIS OF VARIANCE 


WiuramM G. Mapow 
Stanford Research Institute, Menlo Park, California, U. S. A. 


1. Introduction 


The analysis of variance and F-distributions depend on two param- 
eters, the degrees of freedom, f, of the numerator and denominator 
mean squares. 

Tang’s distribution depends not only on the degrees of freedom 
but also on a parameter, A, that determines the power of the analysis 
of variance test. 

The purpose of this note is to present some methods of computing 
both f and X. 


2. Computing f. 

We begin by recalling that the sums of squares that occur in the 
analysis of variance are quadratic forms, and that the number of degrees 
of freedom of the sum of squares is the rank of the quadratic form. 

For example, upon expanding the square, we see that 


S? = ye (y¥: -— 9) = py CiiYiYi = 
where c;; = 1 — 1/n and ¢;; = ¢;; = —1/n; and ¥ is the arithmetic 
mean of y,,°:: ,¥,- The number of degrees of freedom of S’ isn —1, 
which is also the rank of g. 

Now the rank of a quadratic form does not depend on the statistical 
assumptions that may be made. Consequently, we shall assume, in 
order to compute f, that the underlying random variables are normally 
and independently distributed with 0 mean and constant variance o”. 

We shall also assume that g/c” is known to have the x’-distribution. 
Various methods of obtaining analysis of variance ratios guarantee 
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this; for example, the likelihood ratio test of the general linear hypoth- 
esis. 
Furthermore, if g/c” has the x’-distribution then 


He) 


where f is the number of degrees of freedom or rank of g. ‘Thus one 
way to compute the degrees of freedom is to calculate the expected 
value of g. 


Rule 1. Compute the expected value of g under the hypotheses stated 
above. Then 


E(q) = fo° 


where f is the number of degrees of freedom. 

For example, FE >-*., (y; — 9)? = no” — o° = (n —1)o° and hence 
f=n-1. 

But rule 1 can be simplified. Since 


n 


das Ss CisYsYi ; 


t,j7=1 


it follows that if the uncorrelated random variables y, , -+- , y, have 
0 means and constant variance o’, then 


Eq = o Da Cx 
t=1 
and hence f = >.*-, ¢,,; , (the trace of g). 


Rule 2. Determine c;; by putting y; = landy; = 0,7 #zing. (Take 
advantage of symmetry in doing so.) Thus, in 


= 5 be il n—1 
Ly: = 9), eu = lerata SRC a0 ee Yi eR 9 


i=l 


aac «7 


For rule 2 one need not evaluate any expected values. But for 
many quadratic forms the work of computing f can be further reduced 
by using two simple theorems on expected values. 


(1) Ify, +++ , Ym are uncorrelated, and have common mean and 
variance a7, , then 


2 oe = n—1 n—1 
m1-3 = EQ. — 9) = 7 — By = *— ot, 
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(2) Under the same assumptions as (1) we have 
eae) = logs 


It is easy to prove either (1) or (2); the other is an immediate conse- 
quence. 

Now let us illustrate the method. Let a dot, (-), indicate the mean 
for the subscript and let 


i os > (@i3 — &3. — 2; + ae 
s=1 j=l 


Since the expected value of a sum of random variables is the sum of 
their expected values, we need to evaluate 


A;; => EQ; Se ie, ay oh rey ee 


Now 
— 1 , 
Aj; = TH) Te, — 2; by (1) SiC aga : Gay = Le) 
t=1 
= ji = : z 
= ae oo ; by (1) since 7;. = ; “~ Viz » 
Hence 
E@ = fr — DG — 1)o;,, 
and 


f= )e— 1): 
The approach embodied in the above example can be used on many 


sums of squares especially including those occurring in factorial designs. 


Rule 3. If q is a quadratic form which can be written 
ie Ze (=) 


where y; , °:: , y, are uncorrelated, have common means, and common 


variance, o,, , then 
Eig) = ¢ — Iay, 
and, if y; = &; — &. , where &,, is the mean of s uncorrelated random 
variables £;, , £:2 , °° » &s , Which have 0 means and common variance 
o€;; , then 
E(g) = (ry —21)(8 — Dok: 
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and, if the process can be continued on £;; , 
Eq) a (r a 1)(s = 1)(t ae Don 
and so on. 


3. Computing d. It is easy to verify that, if H(y;) = a; , then 


i= 


E(q@ = of + ys Ci;;0; « 
1 
and it has earlier [1] been shown that 


n 
A= 2 C;;0;0; . 


+,7=1 


Thus, \ is the same quadratic form in a, , --: , a, that g is in 
Ya ee Uns 


Rule 4. If E(y;) = a; instead of 0, \ is determined by replacing y; by 
a; in S* where S’ is the sum of squares that becomes g when the squares 
are expanded. 

Suppose, for example, that E(y;) = a. Then if 


Se a 2d (y1 7a iy) 


we have 


If 


8S = 2S, Ds (23; Fr ead + oe 


and E(x;;) = a; + b; , then 
A= DY Via +d) -~@ +5) —G@+b) +@+ HP =0. 
But if H(z,;) = a;; , then 
A= ps ~ (a, —a;, —a,; +4,)’. 
REFERENCE 


[1] W. G. Madow [1940]. The distribution of quadratic forms in non-central normal 
random variables. Ann. Math. Stat., 11: 100-104. 
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128 NOTE: A ROUTINE FOR COMPUTING THE DEGREES OF 
FREEDOM IN ANALYSIS OF VARIANCE 


Berti Marf£rn 
Forest Research Institute, Stockholm &1, Sweden 


This note describes a simple method of automatically computing 
the degrees of freedom along with the sums of squares in an analysis 
of variance. The procedure seems to be useful in routine compu- 
tational work. It may be known to many statisticians, but seems 
not to be described in any textbook. 

A few, rather obvious, remarks will suffice to justify the procedure. 
We consider a number of observations y; , y2, --- . We assume that 
we can write y; = m; + e,; , where the “errors” {e,} are normally and 
independently distributed around zero with constant variance V. 


By 2, , 22, *** , we denote linear forms in the y’s with error variance 
V. Thus every z is of the form )/a,y; with >oa? = 1. In standard 
applications we start by computing certain sums P, , P, , --- , where 


every P is a sum of squares of some 2’s. We then form sums of squares 
attributed to different sources of variation by calculating expressions 
of the type Q = > °9,P, , where the g’s are constants (positive, negative, 
or zero). In all regular cases Q/V has a chi-square distribution under 
the additional hypothesis that the source of variation in question does 
not affect the m’s. The number of degrees of freedom associated with 
Q then equals the expectation of Q/V. From the definitions of {P,} 
and {z;}, it is seen that this expectation is g = dip; , where p; is 


TABLE 1 
EXPERIMENTAL OBSERVATIONS (FICTITIOUS) 


Treatment Block I Block II Sum 
AoBo OG 2.9 5.6 
AoB, 2.0 3.6 5.6 
A\Bo Boil 3.3 6.4 
A,B, 3.8 Dh 7 6.5 
A2Bo Ws 4.3 6.8 
AaB, Ae 4.0 8.3 
Sum 18.4 20.8 Bi! ae 


Treatment totals: 
Aa tt ie els Ome A 15.8, Bop 88) Bs 2204 
Sins nnn ee 


542 BIOMETRICS, DECEMBER 1957 


the number of squared z’s in P; . Thus, Q and q are obtained by the 
same linear operation carried out on the P’s and p’s, respectively. 
Therefore, Q and g can conveniently be computed parallelly. 


TABLE 2 
CALCULATION OF {P;} AND {p;} 


Number 
Grouping Sum of squares of 
squares 


(1) Individual 


I 


P, = 2.7 + --- + 4.0’ 134.1200 | m = 12 


observations 
2 ef e 2 
() AB pa PEE = 130.5300 |= 6 
2 2 2 
(3) A P; = Hee = 129.9650 p= 3 
2 2 
ays pee Bees = 128.2667 | n= 2 
2 2 
(5) Blocks P; = ae = 128.5333 |ps= 2 
(6) Total pas a2 = 128.0533 |pe= 1 
TABLE 3 
ANALYSIS OF VARIANCE 
Computation, from Source of 8.3. d.f. m.8. 
rows of table 2 variation 
(5) — (6) Blocks 0.4800 il 0.4800 
(8) — (6) A 1.9117 2 0.9558 
(4) — (6) B 0.2134 1 0.2134 
(2) — (8) — (4) + (6) AB 0.3516 2 0.1758 
(1) — (2) — (5) + (6) Error 3.1100 5 0.6220 


@)) = (@) Total 6.0667 11 


Example (interaction AB): 
Q = P. — P; — Pa t+ Po = 0.3516 
SI Pr = Ps Pica Dente 
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To give an example, we consider a 3 X 2 factorial experiment in 2 
blocks. The factor A has the three levels A, , 4;, and A,. The factor 
B has the two levels B, , and B,. Table 1 shows the data of the ex- 
periment. The computation of the sums P is given, together with 
the numbers p, in Table 2. Finally, the sums of squares, and degrees 
of freedom, as derived by linear operations on P and p, respectively, 
are shown in Table 3. The example is only intended as an illustration 
of the computational technique. The procedure is, of course, more 
useful in complicated cases. 


CORRECTIONS TO PAPERS 


CHIN LONG CHIANG. An Application of Stochastic Processes to 
Experimental Studies on Flour Beetles. Biometrics, 13(1), 79-97, 1957. 

_ On page 88, in line 20, 6 = (6,, 6,, ---, 6,) should be read as 9 = 
(9: , 82, **-, 4); im line 29, ¢,(@) and in line 30, ¢;(6) should both be 
read as £;(8). On page 90, in equation (48), d,, should be read as d,, . 


F. J. ANSCOMBE. Fixed-Sample-Size Analysis of Sequential Ob- 
servations. Biometrics, 10(1), 89-100, 1954. 


The two consecutive sentences half-way down p. 100, in which the 
names of Fisher and Wald are mentioned, should be deleted. It is 
true that in some instances a distribution obtained by Fisher’s fiducial 
argument, or a decision function obtained by Wald’s minimax rule, 
could have been derived from Bayes’ theorem. But I had not ap- 
preciated, when writing this paper, that the ‘‘prior’ distribution that 
must be inserted in Bayes’ theorem in order to obtain Fisher’s or Wald’s 
result, may itself depend on the sampling rule. The matter is dis- 
cussed in a paper, ‘‘Dependence on the fiducial argument on the sampling 
rule,” to appear in Biometrika, 44 (1957). 


H. R. THOMPSON. Extensions to Missing Plot Techniques. 20- 
metrics, 12(3), 241-244, 1956. 


I am grateful to George Hirschhorn for pointing out an arithmetical 
error in one element of the matrix on page 244 of my paper referenced 
above. The leading element should be 43470 and not 44345. All 
entries in the matrix were obtained from the solution for p = 3 and 
divided by 4 to reduce the number of digits. The missing values in 
Table I are correct, however, as they were calculated using the original 


correct matrix. 
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ABSTRACTS 


Papers presented at the meeting of the Biometric Society (W.N.A.R.) with the American 
Institute of Biological Sciences at Stanford University, Stanford, California, August 
27 and 28, 1957. 


ROSEDITH SITGREAVES (Teachers College, Columbia Uni- 
versity). Discriminating Analysis in a Non-Classical Case. 


In the classical theory of discriminatory analysis, a linear function 
(G.e., Fisher’s discriminant function) of a set of measurements for a 
given individual is used in classifying the individual as belonging to one 
of two populations. This function has been shown to possess some 
optimal properties if the measurements are continuous, and are normally 
distributed in each population with the same covariance matrix. Some 
recent work of Bahadur has been concerned with the problem of classi- 
fication when the measured variables are all dichotomous, and he has 
developed some approximations to the optimal classification statistic 
for this case. In the present paper, a probability model is constructed 
for an experimental situation in psychology involving “aggressive’”’ 
and “non-aggressive” children, and Bahadur’s results are used to de- 
This index is evaluated 


termine an appropriate classification index. 
Approximations to 


in terms of the probabilities of misclassification. 
the probability distributions and the resulting classification index are 


also considered. 


ALBERT C. WALKER (International Minerals and Chemical 
451 Corporation, Woodland, California). Canonical Vectors among 


Sugar Beet Varieties. y 


A canonical analysis of the results of an experiment on thirteen 
varieties of beets is presented. Nine of these had been chosen because 
of high sucrose content. Highly significant differences were present 
both in three commercial characteristics, root weight, sucrose, and 
glutamate content, and in the three auxiliary measurements of top 
weight, sodium, and amino acids. Although there was considerable 
correlation between some of these variables, no reduction in the number 
of dimensions necessary to depict the varietal differences could be 
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obtained. The canonical vectors obtained are interpreted as the best 
available measures of the underlying genetic differences. The order of 
importance, for this experiment, was found to be (1) vigor, or all-over 
utilization of energy, (2) distribution of energy between growth and 
storage in the form of sucrose or glutamate, (3) nitrogen foraging 
ability, (4) distribution of growth between root and top, and (5) a 
curious positive correlation between sodium and sucrose, in two varieties 
known to be carrying a gene causing albinism. 


WILLIAM F. ROYCE (U.S. Fish and Wildlife Service, Juneau). 


oe Statistical Comparisons of Morphological Data. 


The methods of statistical comparison of morphological data are 
examined with special regard to techniques suitable for racial studies 
of tunas. Comparison of samples by means of a test of significance is 
discarded because it leads to a trivial conclusion. In its place is sug- 
gested and defined a measure of overlap of frequency distributions. 

The computation of overlap for single counted characters is extended 
to single measured characters through the use of regression analysis 
and to multiple characters either counted or measured or both. 

The general concept of overlap is related to the proportion of one 
sample which might belong to another specified sample of equal size. 
This is considered to be a maximum for any intermingling which might 
have occurred. It is pointed out that such a maximum is useful when 
considered in conjunction with tagging returns. 


HARLEY B. MESSINGER (University of California, Berkeley). 


4 aa if : 
oe Clinical Trials in Breast Cancer, Statistical Problems. 


The commonest measures of treatment effectiveness in cancer are 
based on survival time after diagnosis or treatment. In the case of 
breast cancer, there is a strong dependence of survival time upon the 
age of the patient. Actuarial analysis of the excellent data of the 
Connecticut Cancer Registry reveals that this age effect can be removed 
by dividing the survival percentage at each point in time after diagnosis 
or treatment by the percentage that would be expected to survive to 
that point in time among a group of people of the same age, sex, and 
race from the general population. 

This fact can be utilized, for example, in the Bross sequential method 
of treatment comparison to avoid the need for matching pairs of patients 
by age (see Billewicz’ article in Biometrics 12:283, 1956). In each un- 
matched pair, one would simply observe which patient lived for the 
greater fraction of her expected survival time. 
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The common fractions obtained with above correction from the 
various age groups by weighted averaging increase with the log of time 
like a cumulative normal distribution function. The lognormal distri- 
bution has been fitted before to uncorrected survival curves, but this 
technique has worked well only in cases where the mortality from cancer 
was so high that normal mortality was negligible, e.g. lung cancer. 

Further, there are indications that the lognormal distribution de- 
scribes well the distribution of intervals from treatment until the 
appearance of metastases as well as certain other similar classes of 
intervals which might have utility as measures of treatment effective- 
ness. ‘Thus, it may be possible to combine other measures of effective- 
ness with survival time by the use of multivariate normal techniques 
to permit much earlier decisions to be made regarding the relative 
merit of two or more treatments. 

MOHAMED 8S. AHMED (University of California, Berkeley). 
454 Test that the Probability of Disease is Independent of the Total 
Length of Time Previously Suffered from the Disease. 


Consider an individual who has suffered from a disease for a total 
length of time r. Let P(t; 7) be the probability that, if he contracts 
the disease again, he remains sick for at least time t. Assume that 
P’(t; r) |1z20 = —Q(r). The distribution of the total time 7 spent in 
sickness in m successive returns is obtained. Let q(r) = a + 2b7, with 
a and 6 non-negative and a known. We test the hypothesis of inde- 
pendence, b = 0. The test criterion is to reject the hypothesis of 
independence whenever 7 < k(n, a)/a, where k(n, a) is adjusted in 
accordance with the desired level of significance a. The test is uni- 
formly most powerful for the restricted class of alternatives b > a’k 
(n, a). A formula for the power function is given. Similar results 
can be obtained for other situations for example, g(r) = a + 2br + 38cr”. 
The problem is more complicated when more than one individual is 
considered. A method for the estimation of parameters is given and 
asymptotically similar tests are worked out for this case. Other appli- 
cations include (7) accident-free flying experience of pilots and (7) 
tunnelling of flour beetles. 


W. F. TAYLOR (School of Aviation Medicine, Randolph Field, 
455 Texas). Some Monte Carlo Methods Applied to an Epidemic of 


Acute Respiratory Disease. 


A probabilistic model and its deterministic counterpart are developed 
to describe the course of acute respiratory disease among recruits at 
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the former Air Force reception center at Sampson Air Force Base, New 
York. Four parameters are estimated and the model is evaluated 
by comparing observed epidemics which occurred among 117 groups of 
recruits with 415 analogous Monte Carlo epidemics resulting from using 
the model. Computations were performed with the aid of a CRC 102A 
electronic computer. 

The following assumptions were made in building the model: 

1. Sources of infection exist both from within and without the 
group. 

2. The risk of a susceptible man becoming infected from within the 
group in a small increment of time, At, is proportional to the number of 
infectious men, J, within the group. Risk from within = XJ. 

3. The risk of infection from without the group is constant. Risk 
from without = wu. 

4. Once recovered from infection, a man has partial immunity, and 
the constants \ and yu in 2 and 3 above are both altered by multiplying 
them by the same factor. Risk from within = adJ. Risk from with- 
Oui = ayn. 

5. There exists an unknown proportion, 8, of men who become 
infected but are not detected. These men remain in the group and 
provide the pool of infection from within. 


J. GURLAND AND S&S. K. KATTI (Iowa State College, Ames, 


4 : ve ; 
20 Iowa). Generalized Families of Contagious Distributions. 


The notion of compound and generalized distribution is utilized to 
obtain the probability generating functions of certain families of con- 
tagious distributions. These families contain the family of Beall and 
Rescia (Biometrics, 1953) as a special case. Properties of the confluent 
hypergeometric function are applied to obtain formulae for computing 
probabilities. Some limiting cases of the families of distributions are 
also considered when certain of the parameters are allowed to become 
infinite. In particular, it is shown that the Neyman Type A and the 
generalized Polya-Aeppli distributions emerge as the limiting form in 
some cases. Different methods of estimation, including the method of 
moments and estimation by frequencies, are also considered in a pre- 
liminary investigation of some data. 


457 DOUGLAS G. CHAPMAN (University of Washington, Seattle, 
Washington). Problems of Estimation of Wildlife Mortality Rates. 


The estimation of mortality rates of wild animals in general has been 
based on one of two methods—recoveries of tagged or marked members 
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of the population and sampling of the population to determine its age 
structure at one or more time periods. The sources of bias in these 
methods are discussed. Several different formulae are reviewed and a 
new formula given which may be useful for those fish and animals 
where the largest cause of mortality is predation by man. The effect 
of the various sources of bias on the formulae is noted. Consideration 
is also given to the determination of estimates of the variances of the 
estimates; it will be usually preferable to find internal rather than 
external estimates of such variances. 


B. O. BERGH (University of California, Riverside, California.) 
458 Dominance Estimates from Tomato Populations Segregating for 
Locule Number. 


Tests of scale adequacy indicated the desirability of a logarithmic 
transformation of the data. Despite some evidence that this trans- 
formation was not strong enough, it was statistically acceptable ac- 
cording to the tests used. 

Estimates of the average absolute dominance of the segregating 
genes were made by two methods. That of Fisher, Immer, and Tedin 
[1932], amplified by Mather [1949], gave highly variable and statis- 
tically unreliable values. However, the results obtained indicate that 
the method developed by Comstock and Robinson [1952] was an 
efficient way to estimate genic dominance in these populations. 

Two major loci were found to be largely responsible for locule 
number differences in these populations. It was possible to assign 
genotypic values to each of the nine major locule number genotypes. 
From these values, a theoretical mean absolute dominance estimate 
could be calculated, based on the two major genes only. This estimate, 
0.66, was not markedly different from the 0.60 estimate obtained for 
all segregating genes by the method of Comstock and Robinson. Sim- 
ilarly, the potence estimate from the two major genes, 0.42, agrees 
closely with the estimate from the non-segregating generations, 0.43, 


W. A. BECKER (State College of Washington, Western Wash- 
459 ington Experiment Station, Puyallup, Washington). Computing 
Mortality in Experiments with Irrelevant Deaths. 


When computing mortality due to a specific cause, a, the experi- 
mentor has to make an adjustment for individuals that died of all other 
causes, b. Let us assume that a and b are independent and no individuals 
die of joint causes a and b. The mortality due to a can be computed by 
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calculating the proportion of individuals that did not die of a during 
each period of time, x, say the 24 hour periods in a yearly experiment. 
These proportions are multiplied and the product subtracted from one 
to give the desired mortality figure. A certain sample size has to be 
maintained during the experiment. 


HANS ABPLANALP (University of California, Davis, Califor- 


aco nia). Effective Population Size Under Selection. 


Effects of artificial selection procedures on the relation between 
actual and effective population size are studied. The analysis is based 
on data from two generations of eleven closed chicken breeding flocks. 
Estimates of effective population size are based on methods derived by 
Crow and Morton (1954, Evolution 9: 202-214), contrasting expected 
sampling variance in gene frequency of an ideal population of N, in- 
dividuals with an estimate of the sampling variance derived from the 
data. 

Four selection procedures are applied to the same breeding records, 
contrasting natural selection to 10 months of age, mass selection for 
a trait with high heritability (body weight), and two selection pro- 
cedures for a trait with low heritability (egg production) where family 
averages are used to supplement information from individual pro- 
duction records. 


R. W. ALLARD (University of California, Davis, California). 
461 Effect of Genotypic—Environmental Interactions on the Predic- 
tion of Genetic Advance under Selection. 


Earliest and latest plants were selected in segregating generations 
of the wheat hybrid Baart X Ramona to determine the progress possible 
under different intensities of selection. In a parallel series of experi- 
ments, estimates were made of various genetic and environmental 
components of variance. Predictions of genetic advance under selection 
based on these estimates often did not correspond closely to the actual 
genetic advance. Although conspicuous interactions between genotype 
and environment occurred in these materials, these interactions did not 
appear to be the main cause of the failure of the predictions. The 
disturbing effect of a major gene on the selection differential, and non- 
allelic gene interactions, unaccounted for in the prediction equation, 
were advanced as more probable explanations for the discrepancy. 
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E. NOVITSKIT AND E. DEMPSTER (Oak Ridge National 
Laboratory, Oak Ridge, Tennessee, and Department of Genetics, 

462 University of California, Berkeley, California). Analysis by 
Automatic Computer of Data from Laboratory Populations of 
Drosophila Melanogaster. 


In some cases heterosis (hybrid vigor) is inferred because a popula- 
tion appears to contain more heterozygotes than can be accounted for 
by ordinary genetic theory. A mathematical model based on a maxi- 
mum likelihood formulation has been set up that allows an estimate to 
be made of the degree to which irregularities in genetic data required 
the assumption of heterosis. The analysis based on such a model is 
not practicable by ordinary methods, and for this reason an automatic 
computer (the Oracle at Oak Ridge National Laboratory) was used. 
The output of the computer showed that the assumption that heterosis 
is responsible for the increased frequency of the heterozygotes is un- 
warranted, and that a reduced viability of homozygous classes will 
equally well explain such data. This result is of some interest because 
it shows that one of the most commonly applied tests for heterosis, a 
statistical test for an excess of heterozygotes in a population, is actually 
ambiguous, 


L. H. KOOPMANS AND DOROTHY C. LOWRY (Uni- 
463 versity of California, Berkeley, California). Estimation of Mean 
and Variance of a Distribution of Binomial Probabilities. 


The following binomial probability situation is frequently en- 
countered. Each individual in a population P has associated with it 
a probability p which reflects the individual’s potential for inducing 
a characteristic C in each of a number of items it produces. Suppose 
that the distribution function of these probabilities is /(p) with mean 
and variance 7 and o,. A random sample of k individuals is selected 
from P. For each i (i = 1, 2, --- , k), n; items from the production of 
the individual are chosen at random and are tested for the presence of 
the characteristic C. On the basis of the resulting numbers of items 
possessing C’ we have derived unbiased estimates of 7 and o,. Anatural 


estimate of 7 is 
1 k 
ay == n De XG, 


ud . . 
>on; and X; is the observed number of items possessing 


i=1 


where n = 
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the characteristic C from the 7" individual. An unbiased estimate of 


T,, Fp, 1S 
k -1 
2 k 72 k 
n— nN; | xX; k—-—1 z 
aA nl | aS ae 
Gr ( r= ie nm — 1 VS 
n— 1 n 
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Exact expressions were derived for the variances of # and 4, . 


ALBERT C. WALKER (International Minerals and Chemical 
464 Corporation, Woodland, California). Balanced Incomplete Diallel 
Crosses. 


Progeny testing of genotypic values can be extended to a wider 
range of parents by use of an incomplete diallel. Comparison may be 
equalized and calculation simplified by application of the balanced 
incomplete block technique. <A table of the number of males, number 
of females, and number of crosses per parent (complete to 200 pro- 
genies), necessary for balanced comparison, is presented. When the 
number of males is different from the number of females, only the 
smaller set gives complete balance. Cases where the larger set affords 
partial balance with only two associate classes are indicated. 

Calculations of best estimates of parental values and their variances 
are detailed, with two examples involving glutamate content of pro- 
genies from sugar beet diallel crosses. 


THE BIOMETRIC SOCIETY 


REGIONAL 
E.N.A.R. 


The Tenth Annual Meeting was held in Atlantic City, September 
10-13, 1957. This was also a joint meeting with the American Statis- 
tical Association and the Institute of Mathematical Statistics. 

Some of the session subjects and speakers were: ° 


A. Methodology and Application of Response Surface Exploration: 
J. 5S. Hunter, R. M. DeBaun; 

B. Forecasting Crop Yields: G. M. Kuznets, Y. Mundlak, H. F. 
Huddleston, R. Parr, L. Calvin; 

C. Stochastic Processes in Biology: J. G. Huffman, E. Fix, E. K. 
Harris; 

D. Genetic Statistics: W. R. Harvey, D.S. Robson, O. Kempthorne; 

E, Fractional Replication: R. L. Anderson, J. D. Hromi, M. 
Carroll, C. Daniel; 

F. Psychometric Statistics: H. Gulliksen, F. M. Lord; 

G. Tolerance Limits: J. E. Walsh, E. A. Thomas, A. Lieberman; 

H. Clinical and Field Trials in Public Health and Medicine: F. M. 
Hemphill, 8. W. Greenhouse, J. Williams, P. Armitage; 

I. Diet and Heart Disease: F. Moore, M. Robins, E. A. Lew. 


Attendance at several sessions was 150 to 200. The annual business 
meeting was also held in Atlantic City. The minutes follow this report. 


E.N.A.R. Minutes of Tenth Annual Business Meeting 


The Tenth Annual Business Meeting of the Eastern North American 
Region of the Biometric Society was called to order at 2:00 p.m., Sep- 
tember 12, in the Ambassador Hotel, Atlantic City. The presiding 
officer was Boyd Harshbarger, the Regional President. There were 
forty-four persons in attendance. The minutes of the Ninth Annual 
Business Meeting were read and approved. 

S. M. Free, Chairman of the Regional Advisory Board, reported on 
a recommendation by that body that the ENAR By-laws be revised 
to make provision for a Regional President-elect. The reasons advanced 
by the RAB for the possible revisions were: 
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1. It would improve the continuity of the office of regional president. 
At present, a president is usually returned to office for a second term. 
At the end of that term a completely uninitiated person takes over the 
position. 

2. The presidential honor (and duties) would be given to more 
people. 

3. The president-elect could provide a needed over-all program 
chairman for ENAR’s many meetings. 

The amendments proposed by the RAB are as follows: 


1. By-law 3. Regional Committee 
Change the first sentence to read: 
“The Regional Committee shall consist of the Regional Presi- 
dent who shall serve as its chairman, the Regional President- 
elect, the Regional Secretary-Treasurer, and six ordinary members 
who shall serve three-year terms.”’ 
2. By-law 7. Program Committees 
Delete present By-law and substitute the following: 
“The Regional President shall appoint program committees to 
arrange meetings in connection with those of specific organiza- 
tions or groups of organizations. ‘The Regional President-elect 
shall act as program coordinator and shall establish appropriate 
liason with program committees of organizations with whom the 
Region meets. He shall report the tentative program plans for 
the coming year at the Annual Meeting.” 
3. By-law 8. Election 
Change the first sentence to read: 
“On or before August 1 each year the Regional Advisory Board 
shall submit nominees for Regional President-elect, Regional 
Secretary-Treasurer, and two nominees for each vacancy among 
ordinary members of the Regional Committee to the Regional 
Secretary-Treasurer.”’ 
4, By-law 8. Election 
Add the following sentence: 
“The Regional President-elect shall serve for one year and at the 
close of this term automatically become Regional President, 
subject to confirmation by the Council of the Society.” 


These amendments were voted on by those attending the meeting. 
The amendments passed without a dissenting vote. 

It was also moved that the ENAR secretary be permitted to cir- 
culate ballots by mail for the election of a Regional President-elect for 
1958. The RAB had already nominated for this position and it was 
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suggested that the ENAR secretary should send these ballots to the 
membership as soon as practicable after the present meeting. 

A short report on the plans for each of the following meetings was 
given: 


Gatlinburg meeting on April 10, 11, 12, 1958. 

Indianapolis meeting after Christmas, 1957. 

The Fourth International Biometric Conference and International 
Symposium on Biometrical Genetics to be held in Ottawa August 
28 to September 2, 1958. 


It was moved that the Chicago meeting with the ASA December 
27-30, 1958, be designated the Eleventh Annual Meeting of ENAR. 

Ralph Bradley was introduced as the new editor of Biometrics. He 
said that: the June issue would be out soon, the September issue on the 
Analysis of Covariance will be late, and the December issue will be out 
on time. 

C. I. Bliss offered the following resolution to be suitably communi- 
cated to John W. Hopkins: Resolved that the Eastern North American 
Region of The Biometric Society express its sincere appreciation for the 
interest, enthusiasm, and leadership contributed by John W. Hopkins 
as editor of Biometrics and in his service to the Region in that capacity. 

The resolution was unanimously subscribed to by the meeting. 

R. G. Hoffman initiated a discussion concerning the problem of 
communication and interchange with the Medical Sciences. It was 
suggested that we should continue meeting with the FASEB and 
initiate further meetings with the medical clinical groups. It was 
suggested that perhaps a committee should be set up to implement 
such meetings. After considerable discussion of initiating further 
meetings of ENAR it was moved, seconded and passed that the Re- 
gional President should appoint a committee to explore and arrange 
with the advice and agreement of the Regional President-elect the 
feasibility of a joint meeting with the American Medical Association, 
the American College of Surgeons, or a similar group at the earliest 
effective and satisfactory date. 

The meeting was adjourned at 3:05 p.m. 

Respectfully submitted, 
Arthur M. Dutton 
Région Belge 

Le 25 octobre 1957, la Société Adolphe Quetelet a eu le plaisir et 
Vhonneur d’accueillir 4 sa tribune le Professeur Jerzy NEYMAN de 
l'Université de Californie de passage en Belgique ainsi que le Profes- 
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seur Robert CONSAEL de l'Université de Bruxelles. Le séance était 
organisée A l’Institut Agronomique de I’Etat & Gembloux et réunissait 
une nombreuse assistance de professeurs et d’étudiants ainsi que des 
membres de la Société, en présence de Monsieur HESPEL, Recteur de 
l'Institut. La conférence du Professeur NEYMAN portait sur un 
travail original qu’il a développé en collaboration de Mademoiselle 
SCOTT sur la Théorie probabiliste du développement des populations 
biologiques congues comme conglomérations de familles. Ce brillant 
exposé donné en francais a été suivi d’une discussion animée dirigée par 
le Président de la Société Adolphe Quetelet, Monsieur R. LAURENT. 
Le Professeur CONSAEL a ensuite parlé du Processus de Poisson 4 une 
ou deux variables aléatoires. Lors de la discussion qui suivit, le Pro- 
fesseur NEYMAN mit en lumiére la parenté existant entre cet exposé 
et le sien, insistant sur la portée pratique de l’un et de l’autre. 


OFFICERS 
HLN.A.R. 


The following officers and committee members were elected by mail 
ballot by the members of the Eastern North American Region:— 
Regional President for 1958—Boyd Harshbarger; President-elect for 
1958—Jerome Cornfield; Secretary—T. W. Horner; Regional Com- 
mittee for 1958-1960—D. B. Duncan and H. O. Hartley. The Presi- 
dent-elect, who will become President in 1959, is a new officer. One of 
the duties of this officer will be to handle the meetings and programs 
of the E.N.A.R. 


WAN Ah: 


The Regional officers for 1958 are as folows:—President—J. L. 
Hodges, Jr.; Secretary-Treasurer—Miss M. M. Sandomire; Regional 
Committee—1956-1958, Bernice Brown, L. Calvin; 1957-1959, W. J. 
Dixon, J. C. R. Li; 1958-1960, D. G. Chapman and D. Wohlschlag. 


NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary 
(of members at large, to the General Secretary) news of appointments, 
distinctions, or retirements and announcements of professional interest. 


Marcus Kjelsberg, University of Minnesota, who spent the last year 
as Special Research Fellow in the Epidemiological Research Unit of 
the Cardiovascular Department of Michael Reese Hospital in Chicago, 
has joined the staff of the Department of Tropical Medicine and Public 
Health, School of Medicine, Tulane University as Instructor in Bio- 
statistics. 


MEETING OF THE BIOMETRIC SOCIETY, E.N.A.R. 


The Biometric Society (E.N.A.R.) will meet jointly with the In- 
stitute of Mathematical Statistics at Gatlinburg, Tennessee, Thursday, 
Friday, and Saturday, April 10, 11, and 12, 1958. Titles and abstracts, 
the latter in duplicate in the form published in Biometrics, for con- 
tributed papers for the Biometric Society should be sent to Donald A. 
Gardiner, Oak Ridge National Laboratory, Post Office Box Y, Oak 
Ridge, Tennessee, not later than March 10, 1958. 


SYMPOSIUM ON BIOMETRICAL GENETICS AND THE 

FOURTH INTERNATIONAL BIOMETRICAL CONGRESS 

The Symposium on Biometrical Genetics and the Fourth Inter- 
national -Biometrical Congress will meet in Ottawa, Canada, from 
August 28 to September 2, 1958. May 1 is the deadline for pre-regi- 
stration and for receiving papers and manuscripts. Queries regarding 
programmes and contributed papers are to be sent to Mr. M. J. R. 
Healy, Secretary, The Biometrics Society, Rothamsted Experimental 
Station, Harpenden Herts., England; queries concerning local ar- 
rangements should be sent to Mr. G. B. Oakland, Statistical Laboratory, 
Science Service Building, Central Experimental arm, Ottawa, Canada. 


NATIONAL SCIENCE FOUNDATION TRAVEL AID 


The National Science Foundation will award individual grants to 
defray partial travel expenses for a limited number of American scientists 
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participating in the Special Session of the International Statistical 
Institute to be held in Brussels, Belgium, September 1-6, 1958. 

Application blanks may be obtained from the National Science 
Foundation, Washington 25, D.C. Completed Application Forms must 
be submitted by March 1, 1958. 


NEW COOPERATIVE PROGRAM IN BIOSTATISTICS 


The Virginia Polytechnic Institute, Blacksburg, Virginia, and the 
Medical College of Virginia, Richmond, Virginia, have initiated a 
cooperative graduate training program in biostatistics through a grant 
from the National Institutes of Health. This program will be supported 
by the combined facilities of the two institutions and is designed to 
meet in part the critical demand for statisticians in the medical and 
public health professions. It is proposed to train both M.S. and Ph.D. 
candidates in statistics as well as to give post doctorate training in the 
field of statistics to holders of Ph.D. degrees in related fields. 

Most of the course work will be given at the Virginia Polytechnic 
Institute, but certain specific courses relating to medical and public 
health statistics will be taken by the candidates at the Medical College 
of Virginia in Richmond. It is expected that in general a doctoral 
candidate will spend one year of his graduate study at the Medical 
College of Virginia. 

To carry on this program, a staff of nine in the Department of 
Statistics at the Virginia Polytechnic Institute is being increased by one 
professor specifically trained in biostatistics, and the Medical College 
is adding a professor of biostatistics who will supervise the work in 
statistics at that division. The number of fellowships for candidates 
for graduate degrees now existing at the Virginia Polytechnic Institute 
is being increased in number as well as in amount to take care of this 
new field of study in biostatistics. In addition to these fellowships 
post doctorate fellowships are available under this new program for 
students in biology, medicine, and other related fields who wish to 
study statistics. 
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